Skip to main content

A proteogenomic survey of the Medicago truncatula genome.

Publication ,  Journal Article
Volkening, JD; Bailey, DJ; Rose, CM; Grimsrud, PA; Howes-Podoll, M; Venkateshwaran, M; Westphall, MS; Ané, J-M; Coon, JJ; Sussman, MR
Published in: Mol Cell Proteomics
October 2012

Peptide sequencing by computational assignment of tandem mass spectra to a database of putative protein sequences provides an independent approach to confirming or refuting protein predictions based on large-scale DNA and RNA sequencing efforts. This use of mass spectrometrically-derived sequence data for testing and refining predicted gene models has been termed proteogenomics. We report herein the application of proteogenomic methodology to a database of 10.9 million tandem mass spectra collected over a period of two years from proteolytically generated peptides isolated from the model legume Medicago truncatula. These spectra were searched against a database of predicted M. truncatula protein sequences generated from public databases, in silico gene model predictions, and a whole-genome six-frame translation. This search identified 78,647 distinct peptide sequences, and a comparison with the publicly available proteome from the recently published M. truncatula genome supported translation of 9,843 existing gene models and identified 1,568 novel peptides suggesting corrections or additions to the current annotations. Each supporting and novel peptide was independently validated using mRNA-derived deep sequencing coverage and an overall correlation of 93% between the two data types was observed. We have additionally highlighted examples of several aspects of structural annotation for which tandem MS provides unique evidence not easily obtainable through typical DNA or RNA sequencing. Proteogenomic analysis is a valuable and unique source of information for the structural annotation of genomes and should be included in such efforts to ensure that the genome models used by biologists mirror as accurately as possible what is present in the cell.

Duke Scholars

Published In

Mol Cell Proteomics

DOI

EISSN

1535-9484

Publication Date

October 2012

Volume

11

Issue

10

Start / End Page

933 / 944

Location

United States

Related Subject Headings

  • Sequence Analysis, DNA
  • Proteomics
  • Proteome
  • Plant Proteins
  • Peptides
  • Molecular Sequence Data
  • Medicago truncatula
  • Mass Spectrometry
  • Information Dissemination
  • Genome, Plant
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Volkening, J. D., Bailey, D. J., Rose, C. M., Grimsrud, P. A., Howes-Podoll, M., Venkateshwaran, M., … Sussman, M. R. (2012). A proteogenomic survey of the Medicago truncatula genome. Mol Cell Proteomics, 11(10), 933–944. https://doi.org/10.1074/mcp.M112.019471
Volkening, Jeremy D., Derek J. Bailey, Christopher M. Rose, Paul A. Grimsrud, Maegen Howes-Podoll, Muthusubramanian Venkateshwaran, Michael S. Westphall, Jean-Michel Ané, Joshua J. Coon, and Michael R. Sussman. “A proteogenomic survey of the Medicago truncatula genome.Mol Cell Proteomics 11, no. 10 (October 2012): 933–44. https://doi.org/10.1074/mcp.M112.019471.
Volkening JD, Bailey DJ, Rose CM, Grimsrud PA, Howes-Podoll M, Venkateshwaran M, et al. A proteogenomic survey of the Medicago truncatula genome. Mol Cell Proteomics. 2012 Oct;11(10):933–44.
Volkening, Jeremy D., et al. “A proteogenomic survey of the Medicago truncatula genome.Mol Cell Proteomics, vol. 11, no. 10, Oct. 2012, pp. 933–44. Pubmed, doi:10.1074/mcp.M112.019471.
Volkening JD, Bailey DJ, Rose CM, Grimsrud PA, Howes-Podoll M, Venkateshwaran M, Westphall MS, Ané J-M, Coon JJ, Sussman MR. A proteogenomic survey of the Medicago truncatula genome. Mol Cell Proteomics. 2012 Oct;11(10):933–944.

Published In

Mol Cell Proteomics

DOI

EISSN

1535-9484

Publication Date

October 2012

Volume

11

Issue

10

Start / End Page

933 / 944

Location

United States

Related Subject Headings

  • Sequence Analysis, DNA
  • Proteomics
  • Proteome
  • Plant Proteins
  • Peptides
  • Molecular Sequence Data
  • Medicago truncatula
  • Mass Spectrometry
  • Information Dissemination
  • Genome, Plant