Skip to main content

Learning protein-DNA interaction landscapes by integrating experimental data through computational models.

Publication ,  Journal Article
Zhong, J; Wasson, T; Hartemink, AJ
Published in: Bioinformatics
October 15, 2014

MOTIVATION: Transcriptional regulation is directly enacted by the interactions between DNA and many proteins, including transcription factors (TFs), nucleosomes and polymerases. A critical step in deciphering transcriptional regulation is to infer, and eventually predict, the precise locations of these interactions, along with their strength and frequency. While recent datasets yield great insight into these interactions, individual data sources often provide only partial information regarding one aspect of the complete interaction landscape. For example, chromatin immunoprecipitation (ChIP) reveals the binding positions of a protein, but only for one protein at a time. In contrast, nucleases like MNase and DNase can be used to reveal binding positions for many different proteins at once, but cannot easily determine the identities of those proteins. Currently, few statistical frameworks jointly model these different data sources to reveal an accurate, holistic view of the in vivo protein-DNA interaction landscape. RESULTS: Here, we develop a novel statistical framework that integrates different sources of experimental information within a thermodynamic model of competitive binding to jointly learn a holistic view of the in vivo protein-DNA interaction landscape. We show that our framework learns an interaction landscape with increased accuracy, explaining multiple sets of data in accordance with thermodynamic principles of competitive DNA binding. The resulting model of genomic occupancy provides a precise mechanistic vantage point from which to explore the role of protein-DNA interactions in transcriptional regulation. AVAILABILITY AND IMPLEMENTATION: The C source code for compete and Python source code for MCMC-based inference are available at http://www.cs.duke.edu/∼amink. CONTACT: amink@cs.duke.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Bioinformatics

DOI

EISSN

1367-4811

Publication Date

October 15, 2014

Volume

30

Issue

20

Start / End Page

2868 / 2874

Location

England

Related Subject Headings

  • Transcription, Genetic
  • Transcription Factors
  • Thermodynamics
  • Protein Binding
  • Nucleosomes
  • Models, Biological
  • Gene Expression Regulation
  • DNA-Binding Proteins
  • DNA
  • Computational Biology
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Zhong, J., Wasson, T., & Hartemink, A. J. (2014). Learning protein-DNA interaction landscapes by integrating experimental data through computational models. Bioinformatics, 30(20), 2868–2874. https://doi.org/10.1093/bioinformatics/btu408
Zhong, Jianling, Todd Wasson, and Alexander J. Hartemink. “Learning protein-DNA interaction landscapes by integrating experimental data through computational models.Bioinformatics 30, no. 20 (October 15, 2014): 2868–74. https://doi.org/10.1093/bioinformatics/btu408.
Zhong J, Wasson T, Hartemink AJ. Learning protein-DNA interaction landscapes by integrating experimental data through computational models. Bioinformatics. 2014 Oct 15;30(20):2868–74.
Zhong, Jianling, et al. “Learning protein-DNA interaction landscapes by integrating experimental data through computational models.Bioinformatics, vol. 30, no. 20, Oct. 2014, pp. 2868–74. Pubmed, doi:10.1093/bioinformatics/btu408.
Zhong J, Wasson T, Hartemink AJ. Learning protein-DNA interaction landscapes by integrating experimental data through computational models. Bioinformatics. 2014 Oct 15;30(20):2868–2874.

Published In

Bioinformatics

DOI

EISSN

1367-4811

Publication Date

October 15, 2014

Volume

30

Issue

20

Start / End Page

2868 / 2874

Location

England

Related Subject Headings

  • Transcription, Genetic
  • Transcription Factors
  • Thermodynamics
  • Protein Binding
  • Nucleosomes
  • Models, Biological
  • Gene Expression Regulation
  • DNA-Binding Proteins
  • DNA
  • Computational Biology