Motif composition, conservation and condition-specificity of single and alternative transcription start sites in the Drosophila genome.

Published

Journal Article

BACKGROUND: Transcription initiation is a key component in the regulation of gene expression. mRNA 5' full-length sequencing techniques have enhanced our understanding of mammalian transcription start sites (TSSs), revealing different initiation patterns on a genomic scale. RESULTS: To identify TSSs in Drosophila melanogaster, we applied a hierarchical clustering strategy on available 5' expressed sequence tags (ESTs) and identified a high quality set of 5,665 TSSs for approximately 4,000 genes. We distinguished two initiation patterns: 'peaked' TSSs, and 'broad' TSS cluster groups. Peaked promoters were found to contain location-specific sequence elements; conversely, broad promoters were associated with non-location-specific elements. In alignments across other Drosophila genomes, conservation levels of sequence elements exceeded 90% within the melanogaster subgroup, but dropped considerably for distal species. Elements in broad promoters had lower levels of conservation than those in peaked promoters. When characterizing the distributions of ESTs, 64% of TSSs showed distinct associations to one out of eight different spatiotemporal conditions. Available whole-genome tiling array time series data revealed different temporal patterns of embryonic activity across the majority of genes with distinct alternative promoters. Many genes with maternally inherited transcripts were found to have alternative promoters utilized later in development. Core promoters of maternally inherited transcripts showed differences in motif composition compared to zygotically active promoters. CONCLUSIONS: Our study provides a comprehensive map of Drosophila TSSs and the conditions under which they are utilized. Distinct differences in motif associations with initiation pattern and spatiotemporal utilization illustrate the complex regulatory code of transcription initiation.

Full Text

Duke Authors

Cited Authors

  • Rach, EA; Yuan, H-Y; Majoros, WH; Tomancak, P; Ohler, U

Published Date

  • 2009

Published In

Volume / Issue

  • 10 / 7

Start / End Page

  • R73 -

PubMed ID

  • 19589141

Pubmed Central ID

  • 19589141

Electronic International Standard Serial Number (EISSN)

  • 1474-760X

Digital Object Identifier (DOI)

  • 10.1186/gb-2009-10-7-r73

Language

  • eng

Conference Location

  • England