Nucleosome occupancy information improves de novo motif discovery


Journal Article

A complete understanding of transcriptional regulatory processes in the cell requires identification of transcription factor binding sites on a genomewide scale. Unfortunately, these binding sites are typically short and degenerate, posing a significant statistical challenge: many more matches to known transcription factor binding sites occur in the genome than are actually functional. Chromatin structure is known to play an important role in guiding transcription factors to those sites that are functional. In particular, it has been shown that active regulatory regions are usually depleted of nucleosomes, thereby enabling transcription factors to bind DNA in those regions [I]. In this paper, we describe a novel algorithm which employs an informative prior over DNA sequence positions based on a discriminative view of nucleosome occupancy; the nucleosome occupancy information comes from a recently published computational model [2], When a Gibbs sampling algorithm with our informative prior is applied to yeast sequencesets identified by ChIP-chip [3], the correct motif is found in 50% more cases than with an uninformative uniform prior. Moreover, if nucleosome occupancy information is not available, our informative prior reduces to a new kind of prior that can exploit discriminative information in a purely generative setting. © Springer-Verlag Berlin Heidelberg 2007.

Full Text

Duke Authors

Cited Authors

  • Narlikar, L; Gordân, R; Hartemink, AJ

Published Date

  • January 1, 2007

Published In

Volume / Issue

  • 4453 LNBI /

Start / End Page

  • 107 - 121

Electronic International Standard Serial Number (EISSN)

  • 1611-3349

International Standard Serial Number (ISSN)

  • 0302-9743

Digital Object Identifier (DOI)

  • 10.1007/978-3-540-71681-5_8

Citation Source

  • Scopus