Skip to main content

Using dynamic programming to create isotopic distribution maps from mass spectra.

Publication ,  Conference
McIlwain, S; Page, D; Huttlin, EL; Sussman, MR
Published in: Bioinformatics
July 1, 2007

MOTIVATION: This article presents a method to identify the isotopic distributions within a mass spectrum using a probabilistic classifier supplemented with dynamic programming. Such a system is needed for a variety of purposes, including generating robust and meaningful features from mass spectra to be used in classification. RESULTS: The primary result of this article is that the dynamic programming approach significantly improves sensitivity, without harming specificity, of a probabilistic classifier for identifying the isotopic distributions. When annotating isotopic distributions where an expert has performed the initial 'peak-picking' (removal of noise peaks), the dynamic programming approach gives a true positive rate of 96% and a false positive rate of 0.0%, whereas the classifier alone has a true positive rate of only 47% when the false positive rate is 0.0%. When annotating isotopic distributions in machine peak-picked spectra, which may contain many noise peaks, the dynamic programming approach gives a true positive rate of only 22.0%, but it still keeps a low false positive rate of 1.0% and still outperforms the classifier alone. It is important to note that all these rates are when we require exact matches with the distributions in annotated spectra; in our evaluation a distribution is considered 'entirely incorrect' if it is missing even one peak or contains even one extraneous peak. We compared to the THRASH and AID-MS systems using a looser requirement: correctly identifying the distribution that contains the mono-isotopic mass. Under this measure, our dynamic programming approach achieves a true positive rate of 82% and a false positive rate of 1%, which again outperforms the classifier alone. The dynamic programming approach ends up being more conservative than THRASH and AID-MS, yielding both fewer true and false peaks, but the F-score of the dynamic programming approach is significantly better than those of THRASH and AID-MS. All results were obtained with 10-fold cross-validation of 99 sections of mass spectra with a total of 214 hand-annotated isotopic distributions. AVAILABILITY: Programs are available via http://www.cs.wisc.edu/~mcilwain/IDM.

Duke Scholars

Published In

Bioinformatics

DOI

EISSN

1367-4811

Publication Date

July 1, 2007

Volume

23

Issue

13

Start / End Page

i328 / i336

Location

England

Related Subject Headings

  • Sensitivity and Specificity
  • Reproducibility of Results
  • Radioisotopes
  • Peptide Mapping
  • Pattern Recognition, Automated
  • Models, Chemical
  • Mass Spectrometry
  • Isotope Labeling
  • Computer Simulation
  • Bioinformatics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
McIlwain, S., Page, D., Huttlin, E. L., & Sussman, M. R. (2007). Using dynamic programming to create isotopic distribution maps from mass spectra. In Bioinformatics (Vol. 23, pp. i328–i336). England. https://doi.org/10.1093/bioinformatics/btm198
McIlwain, Sean, David Page, Edward L. Huttlin, and Michael R. Sussman. “Using dynamic programming to create isotopic distribution maps from mass spectra.” In Bioinformatics, 23:i328–36, 2007. https://doi.org/10.1093/bioinformatics/btm198.
McIlwain S, Page D, Huttlin EL, Sussman MR. Using dynamic programming to create isotopic distribution maps from mass spectra. In: Bioinformatics. 2007. p. i328–36.
McIlwain, Sean, et al. “Using dynamic programming to create isotopic distribution maps from mass spectra.Bioinformatics, vol. 23, no. 13, 2007, pp. i328–36. Pubmed, doi:10.1093/bioinformatics/btm198.
McIlwain S, Page D, Huttlin EL, Sussman MR. Using dynamic programming to create isotopic distribution maps from mass spectra. Bioinformatics. 2007. p. i328–i336.

Published In

Bioinformatics

DOI

EISSN

1367-4811

Publication Date

July 1, 2007

Volume

23

Issue

13

Start / End Page

i328 / i336

Location

England

Related Subject Headings

  • Sensitivity and Specificity
  • Reproducibility of Results
  • Radioisotopes
  • Peptide Mapping
  • Pattern Recognition, Automated
  • Models, Chemical
  • Mass Spectrometry
  • Isotope Labeling
  • Computer Simulation
  • Bioinformatics