Reinvestigation of the Saccharomyces cerevisiae genome annotation by comparison to the genome of a related fungus: Ashbya gossypii.

Journal Article

BACKGROUND: The recently sequenced genome of the filamentous fungus Ashbya gossypii revealed remarkable similarities to that of the budding yeast Saccharomyces cerevisiae both at the level of homology and synteny (conservation of gene order). Thus, it became possible to reinvestigate the S. cerevisiae genome in the syntenic regions leading to an improved annotation. RESULTS: We have identified 23 novel S. cerevisiae open reading frames (ORFs) as syntenic homologs of A. gossypii genes; for all but one, homologs are present in other eukaryotes including humans. Other comparisons identified 13 overlooked introns and suggested 69 potential sequence corrections resulting in ORF extensions or ORF fusions with improved homology to the syntenic A. gossypii homologs. Of the proposed corrections, 25 were tested and confirmed by resequencing. In addition, homologs of nearly 1,000 S. cerevisiae ORFs, presently annotated as hypothetical, were found in A. gossypii at syntenic positions and can therefore be considered as authentic genes. Finally, we suggest that over 400 S. cerevisiae ORFs that overlap other ORFs in S. cerevisiae and for which no homolog can be detected in A. gossypii should be regarded as spurious. CONCLUSIONS: Although, the S. cerevisiae genome is rightly considered as one of the most accurately sequenced and annotated eukaryotic genomes, we have shown that it still benefits substantially from comparison to the completed sequence and syntenic gene map of A. gossypii, an evolutionarily related fungus. This type of approach will strongly support the annotation of more complex genomes such as the human and murine genomes.

Full Text

Duke Authors

Cited Authors

  • Brachat, S; Dietrich, FS; Voegeli, S; Zhang, Z; Stuart, L; Lerch, A; Gates, K; Gaffney, T; Philippsen, P

Published Date

  • 2003

Published In

Volume / Issue

  • 4 / 7

Start / End Page

  • R45 -

PubMed ID

  • 12844361

Electronic International Standard Serial Number (EISSN)

  • 1474-760X

Digital Object Identifier (DOI)

  • 10.1186/gb-2003-4-7-r45

Language

  • eng

Conference Location

  • England