Skip to main content

Patterns of flanking sequence conservation and a characteristic upstream motif for microRNA gene identification.

Publication ,  Journal Article
Ohler, U; Yekta, S; Lim, LP; Bartel, DP; Burge, CB
Published in: RNA
September 2004

MicroRNAs are approximately 22-nucleotide (nt) RNAs processed from foldback segments of endogenous transcripts. Some are known to play important gene regulatory roles during animal and plant development by pairing to the messages of protein-coding genes to direct the post-transcriptional repression of these messages. Previously, we developed a computational method called MiRscan, which scores features related to the foldbacks, and used this algorithm to identify new miRNA genes in the nematode Caenorhabditis elegans. In the present study, to identify sequences that might be involved in processing or transcriptional regulation of miRNAs, we aligned sequences upstream and downstream of orthologous nematode miRNA foldbacks. These alignments showed a pronounced peak in sequence conservation about 200 bp upstream of the miRNA foldback and revealed a highly significant sequence motif, with consensus CTCCGCCC, that is present upstream of almost all independently transcribed nematode miRNA genes. Scoring the pattern of upstream/downstream conservation, the occurrence of this sequence motif, and orthology of host genes for intronic miRNA candidates, yielded substantial improvements in the accuracy of MiRscan. Nine new C. elegans miRNA gene candidates were validated using a PCR-sequencing protocol. As previously seen for bacterial RNA genes, sequence features outside of the RNA secondary structure can therefore be very useful for the computational identification of eukaryotic noncoding RNA genes. The total number of confidently identified nematode miRNAs now approaches 100. The improved analysis supports our previous assertion that miRNA gene identification is nearing completion in C. elegans with apparently no more than 20 miRNA genes now remaining to be identified.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

RNA

DOI

ISSN

1355-8382

Publication Date

September 2004

Volume

10

Issue

9

Start / End Page

1309 / 1322

Location

United States

Related Subject Headings

  • Transcription, Genetic
  • RNA, Untranslated
  • RNA, Helminth
  • RNA Processing, Post-Transcriptional
  • MicroRNAs
  • Genes, Helminth
  • Gene Expression Regulation
  • Evolution, Molecular
  • Developmental Biology
  • Conserved Sequence
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Ohler, U., Yekta, S., Lim, L. P., Bartel, D. P., & Burge, C. B. (2004). Patterns of flanking sequence conservation and a characteristic upstream motif for microRNA gene identification. RNA, 10(9), 1309–1322. https://doi.org/10.1261/rna.5206304
Ohler, Uwe, Soraya Yekta, Lee P. Lim, David P. Bartel, and Christopher B. Burge. “Patterns of flanking sequence conservation and a characteristic upstream motif for microRNA gene identification.RNA 10, no. 9 (September 2004): 1309–22. https://doi.org/10.1261/rna.5206304.
Ohler U, Yekta S, Lim LP, Bartel DP, Burge CB. Patterns of flanking sequence conservation and a characteristic upstream motif for microRNA gene identification. RNA. 2004 Sep;10(9):1309–22.
Ohler, Uwe, et al. “Patterns of flanking sequence conservation and a characteristic upstream motif for microRNA gene identification.RNA, vol. 10, no. 9, Sept. 2004, pp. 1309–22. Pubmed, doi:10.1261/rna.5206304.
Ohler U, Yekta S, Lim LP, Bartel DP, Burge CB. Patterns of flanking sequence conservation and a characteristic upstream motif for microRNA gene identification. RNA. 2004 Sep;10(9):1309–1322.

Published In

RNA

DOI

ISSN

1355-8382

Publication Date

September 2004

Volume

10

Issue

9

Start / End Page

1309 / 1322

Location

United States

Related Subject Headings

  • Transcription, Genetic
  • RNA, Untranslated
  • RNA, Helminth
  • RNA Processing, Post-Transcriptional
  • MicroRNAs
  • Genes, Helminth
  • Gene Expression Regulation
  • Evolution, Molecular
  • Developmental Biology
  • Conserved Sequence