Automated identification of protein-ligand interaction features using Inductive Logic Programming: a hexose binding case study.
Published online
Journal Article
BACKGROUND: There is a need for automated methods to learn general features of the interactions of a ligand class with its diverse set of protein receptors. An appropriate machine learning approach is Inductive Logic Programming (ILP), which automatically generates comprehensible rules in addition to prediction. The development of ILP systems which can learn rules of the complexity required for studies on protein structure remains a challenge. In this work we use a new ILP system, ProGolem, and demonstrate its performance on learning features of hexose-protein interactions. RESULTS: The rules induced by ProGolem detect interactions mediated by aromatics and by planar-polar residues, in addition to less common features such as the aromatic sandwich. The rules also reveal a previously unreported dependency for residues cys and leu. They also specify interactions involving aromatic and hydrogen bonding residues. This paper shows that Inductive Logic Programming implemented in ProGolem can derive rules giving structural features of protein/ligand interactions. Several of these rules are consistent with descriptions in the literature. CONCLUSIONS: In addition to confirming literature results, ProGolem's model has a 10-fold cross-validated predictive accuracy that is superior, at the 95% confidence level, to another ILP system previously used to study protein/hexose interactions and is comparable with state-of-the-art statistical learners.
Full Text
Duke Authors
Cited Authors
- A Santos, JC; Nassif, H; Page, D; Muggleton, SH; E Sternberg, MJ
Published Date
- July 11, 2012
Published In
Volume / Issue
- 13 /
Start / End Page
- 162 -
PubMed ID
- 22783946
Pubmed Central ID
- 22783946
Electronic International Standard Serial Number (EISSN)
- 1471-2105
Digital Object Identifier (DOI)
- 10.1186/1471-2105-13-162
Language
- eng
Conference Location
- England