Skip to main content

Gene prediction and verification in a compact genome with numerous small introns.

Publication ,  Journal Article
Tenney, AE; Brown, RH; Vaske, C; Lodge, JK; Doering, TL; Brent, MR
Published in: Genome Res
November 2004

The genomes of clusters of related eukaryotes are now being sequenced at an increasing rate, creating a need for accurate, low-cost annotation of exon-intron structures. In this paper, we demonstrate that reverse transcription-polymerase chain reaction (RT-PCR) and direct sequencing based on predicted gene structures satisfy this need, at least for single-celled eukaryotes. The TWINSCAN gene prediction algorithm was adapted for the fungal pathogen Cryptococcus neoformans by using a precise model of intron lengths in combination with ungapped alignments between the genome sequences of the two closely related Cryptococcus varieties. This approach resulted in approximately 60% of known genes being predicted exactly right at every coding base and splice site. When previously unannotated TWINSCAN predictions were tested by RT-PCR and direct sequencing, 75% of targets spanning two predicted introns were amplified and produced high-quality sequence. When targets spanning the complete predicted open reading frame were tested, 72% of them amplified and produced high-quality sequence. We conclude that sequencing a small number of expressed sequence tags (ESTs) to provide training data, running TWINSCAN on an entire genome, and then performing RT-PCR and direct sequencing on all of its predictions would be a cost-effective method for obtaining an experimentally verified genome annotation.

Duke Scholars

Published In

Genome Res

DOI

ISSN

1088-9051

Publication Date

November 2004

Volume

14

Issue

11

Start / End Page

2330 / 2335

Location

United States

Related Subject Headings

  • Software
  • Sequence Analysis, DNA
  • Sequence Alignment
  • Reverse Transcriptase Polymerase Chain Reaction
  • Predictive Value of Tests
  • Introns
  • Genome, Fungal
  • Cryptococcus neoformans
  • Computational Biology
  • Bioinformatics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Tenney, A. E., Brown, R. H., Vaske, C., Lodge, J. K., Doering, T. L., & Brent, M. R. (2004). Gene prediction and verification in a compact genome with numerous small introns. Genome Res, 14(11), 2330–2335. https://doi.org/10.1101/gr.2816704
Tenney, Aaron E., Randall H. Brown, Charles Vaske, Jennifer K. Lodge, Tamara L. Doering, and Michael R. Brent. “Gene prediction and verification in a compact genome with numerous small introns.Genome Res 14, no. 11 (November 2004): 2330–35. https://doi.org/10.1101/gr.2816704.
Tenney AE, Brown RH, Vaske C, Lodge JK, Doering TL, Brent MR. Gene prediction and verification in a compact genome with numerous small introns. Genome Res. 2004 Nov;14(11):2330–5.
Tenney, Aaron E., et al. “Gene prediction and verification in a compact genome with numerous small introns.Genome Res, vol. 14, no. 11, Nov. 2004, pp. 2330–35. Pubmed, doi:10.1101/gr.2816704.
Tenney AE, Brown RH, Vaske C, Lodge JK, Doering TL, Brent MR. Gene prediction and verification in a compact genome with numerous small introns. Genome Res. 2004 Nov;14(11):2330–2335.

Published In

Genome Res

DOI

ISSN

1088-9051

Publication Date

November 2004

Volume

14

Issue

11

Start / End Page

2330 / 2335

Location

United States

Related Subject Headings

  • Software
  • Sequence Analysis, DNA
  • Sequence Alignment
  • Reverse Transcriptase Polymerase Chain Reaction
  • Predictive Value of Tests
  • Introns
  • Genome, Fungal
  • Cryptococcus neoformans
  • Computational Biology
  • Bioinformatics