Skip to main content
Journal cover image

Predicting and annotating catalytic residues: an information theoretic approach.

Publication ,  Journal Article
Sterner, B; Singh, R; Berger, B
Published in: J Comput Biol
October 2007

We introduce a computational method to predict and annotate the catalytic residues of a protein using only its sequence information, so that we describe both the residues' sequence locations (prediction) and their specific biochemical roles in the catalyzed reaction (annotation). While knowing the chemistry of an enzyme's catalytic residues is essential to understanding its function, the challenges of prediction and annotation have remained difficult, especially when only the enzyme's sequence and no homologous structures are available. Our sequence-based approach follows the guiding principle that catalytic residues performing the same biochemical function should have similar chemical environments; it detects specific conservation patterns near in sequence to known catalytic residues and accordingly constrains what combination of amino acids can be present near a predicted catalytic residue. We associate with each catalytic residue a short sequence profile and define a Kullback-Leibler (KL) distance measure between these profiles, which, as we show, effectively captures even subtle biochemical variations. We apply the method to the class of glycohydrolase enzymes. This class includes proteins from 96 families with very different sequences and folds, many of which perform important functions. In a cross-validation test, our approach correctly predicts the location of the enzymes' catalytic residues with a sensitivity of 80% at a specificity of 99.4%, and in a separate cross-validation we also correctly annotate the biochemical role of 80% of the catalytic residues. Our results compare favorably to existing methods. Moreover, our method is more broadly applicable because it relies on sequence and not structure information; it may, furthermore, be used in conjunction with structure-based methods.

Duke Scholars

Published In

J Comput Biol

DOI

ISSN

1066-5277

Publication Date

October 2007

Volume

14

Issue

8

Start / End Page

1058 / 1073

Location

United States

Related Subject Headings

  • Proteomics
  • Protein Conformation
  • Models, Molecular
  • Information Theory
  • Humans
  • Glycoside Hydrolases
  • Enzymes
  • Databases, Protein
  • Computational Biology
  • Catalytic Domain
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Sterner, B., Singh, R., & Berger, B. (2007). Predicting and annotating catalytic residues: an information theoretic approach. J Comput Biol, 14(8), 1058–1073. https://doi.org/10.1089/cmb.2007.0042
Sterner, Beckett, Rohit Singh, and Bonnie Berger. “Predicting and annotating catalytic residues: an information theoretic approach.J Comput Biol 14, no. 8 (October 2007): 1058–73. https://doi.org/10.1089/cmb.2007.0042.
Sterner B, Singh R, Berger B. Predicting and annotating catalytic residues: an information theoretic approach. J Comput Biol. 2007 Oct;14(8):1058–73.
Sterner, Beckett, et al. “Predicting and annotating catalytic residues: an information theoretic approach.J Comput Biol, vol. 14, no. 8, Oct. 2007, pp. 1058–73. Pubmed, doi:10.1089/cmb.2007.0042.
Sterner B, Singh R, Berger B. Predicting and annotating catalytic residues: an information theoretic approach. J Comput Biol. 2007 Oct;14(8):1058–1073.
Journal cover image

Published In

J Comput Biol

DOI

ISSN

1066-5277

Publication Date

October 2007

Volume

14

Issue

8

Start / End Page

1058 / 1073

Location

United States

Related Subject Headings

  • Proteomics
  • Protein Conformation
  • Models, Molecular
  • Information Theory
  • Humans
  • Glycoside Hydrolases
  • Enzymes
  • Databases, Protein
  • Computational Biology
  • Catalytic Domain