Skip to main content

Bayesian protein structure alignment

Publication ,  Journal Article
Rodriguez, A; Schmidler, SC
Published in: Annals of Applied Statistics
December 1, 2014

The analysis of the three-dimensional structure of proteins is an important topic in molecular biochemistry. Structure plays a critical role in defining the function of proteins and is more strongly conserved than amino acid sequence over evolutionary timescales. A key challenge is the identification and evaluation of structural similarity between proteins; such analysis can aid in understanding the role of newly discovered proteins and help elucidate evolutionary relationships between organisms. Computational biologists have developed many clever algorithmic techniques for comparing protein structures, however, all are based on heuristic optimization criteria, making statistical interpretation somewhat difficult. Here we present a fully probabilistic framework for pairwise structural alignment of proteins. Our approach has several advantages, including the ability to capture alignment uncertainty and to estimate key “gap” parameters which critically affect the quality of the alignment. We show that several existing alignment methods arise as maximum a posteriori estimates under specific choices of prior distributions and error models. Our probabilistic framework is also easily extended to incorporate additional information, which we demonstrate by including primary sequence information to generate simultaneous sequence-structure alignments that can resolve ambiguities obtained using structure alone. This combined model also provides a natural approach for the difficult task of estimating evolutionary distance based on structural alignments. The model is illustrated by comparison with well-established methods on several challenging protein alignment examples.

Duke Scholars

Published In

Annals of Applied Statistics

DOI

EISSN

1941-7330

ISSN

1932-6157

Publication Date

December 1, 2014

Volume

8

Issue

4

Start / End Page

2068 / 2095

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 1403 Econometrics
  • 0104 Statistics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Rodriguez, A., & Schmidler, S. C. (2014). Bayesian protein structure alignment. Annals of Applied Statistics, 8(4), 2068–2095. https://doi.org/10.1214/14-AOAS780
Rodriguez, A., and S. C. Schmidler. “Bayesian protein structure alignment.” Annals of Applied Statistics 8, no. 4 (December 1, 2014): 2068–95. https://doi.org/10.1214/14-AOAS780.
Rodriguez A, Schmidler SC. Bayesian protein structure alignment. Annals of Applied Statistics. 2014 Dec 1;8(4):2068–95.
Rodriguez, A., and S. C. Schmidler. “Bayesian protein structure alignment.” Annals of Applied Statistics, vol. 8, no. 4, Dec. 2014, pp. 2068–95. Scopus, doi:10.1214/14-AOAS780.
Rodriguez A, Schmidler SC. Bayesian protein structure alignment. Annals of Applied Statistics. 2014 Dec 1;8(4):2068–2095.

Published In

Annals of Applied Statistics

DOI

EISSN

1941-7330

ISSN

1932-6157

Publication Date

December 1, 2014

Volume

8

Issue

4

Start / End Page

2068 / 2095

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 1403 Econometrics
  • 0104 Statistics