Skip to main content
Journal cover image

Bayesian segmentation of protein secondary structure.

Publication ,  Journal Article
Schmidler, SC; Liu, JS; Brutlag, DL
Published in: Journal of computational biology : a journal of computational molecular cell biology
February 2000

We present a novel method for predicting the secondary structure of a protein from its amino acid sequence. Most existing methods predict each position in turn based on a local window of residues, sliding this window along the length of the sequence. In contrast, we develop a probabilistic model of protein sequence/structure relationships in terms of structural segments, and formulate secondary structure prediction as a general Bayesian inference problem. A distinctive feature of our approach is the ability to develop explicit probabilistic models for alpha-helices, beta-strands, and other classes of secondary structure, incorporating experimentally and empirically observed aspects of protein structure such as helical capping signals, side chain correlations, and segment length distributions. Our model is Markovian in the segments, permitting efficient exact calculation of the posterior probability distribution over all possible segmentations of the sequence using dynamic programming. The optimal segmentation is computed and compared to a predictor based on marginal posterior modes, and the latter is shown to provide significant improvement in predictive accuracy. The marginalization procedure provides exact secondary structure probabilities at each sequence position, which are shown to be reliable estimates of prediction uncertainty. We apply this model to a database of 452 nonhomologous structures, achieving accuracies as high as the best currently available methods. We conclude by discussing an extension of this framework to model nonlocal interactions in protein structures, providing a possible direction for future improvements in secondary structure prediction accuracy.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Journal of computational biology : a journal of computational molecular cell biology

DOI

EISSN

1557-8666

ISSN

1066-5277

Publication Date

February 2000

Volume

7

Issue

1-2

Start / End Page

233 / 248

Related Subject Headings

  • beta-Lactamases
  • Proteins
  • Protein Structure, Secondary
  • Molecular Sequence Data
  • Models, Statistical
  • Models, Molecular
  • Markov Chains
  • Biometry
  • Bioinformatics
  • Bayes Theorem
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Schmidler, S. C., Liu, J. S., & Brutlag, D. L. (2000). Bayesian segmentation of protein secondary structure. Journal of Computational Biology : A Journal of Computational Molecular Cell Biology, 7(1–2), 233–248. https://doi.org/10.1089/10665270050081496
Schmidler, S. C., J. S. Liu, and D. L. Brutlag. “Bayesian segmentation of protein secondary structure.Journal of Computational Biology : A Journal of Computational Molecular Cell Biology 7, no. 1–2 (February 2000): 233–48. https://doi.org/10.1089/10665270050081496.
Schmidler SC, Liu JS, Brutlag DL. Bayesian segmentation of protein secondary structure. Journal of computational biology : a journal of computational molecular cell biology. 2000 Feb;7(1–2):233–48.
Schmidler, S. C., et al. “Bayesian segmentation of protein secondary structure.Journal of Computational Biology : A Journal of Computational Molecular Cell Biology, vol. 7, no. 1–2, Feb. 2000, pp. 233–48. Epmc, doi:10.1089/10665270050081496.
Schmidler SC, Liu JS, Brutlag DL. Bayesian segmentation of protein secondary structure. Journal of computational biology : a journal of computational molecular cell biology. 2000 Feb;7(1–2):233–248.
Journal cover image

Published In

Journal of computational biology : a journal of computational molecular cell biology

DOI

EISSN

1557-8666

ISSN

1066-5277

Publication Date

February 2000

Volume

7

Issue

1-2

Start / End Page

233 / 248

Related Subject Headings

  • beta-Lactamases
  • Proteins
  • Protein Structure, Secondary
  • Molecular Sequence Data
  • Models, Statistical
  • Models, Molecular
  • Markov Chains
  • Biometry
  • Bioinformatics
  • Bayes Theorem