Skip to main content
construction release_alert
Scholars@Duke will be down for maintenance for approximately one hour starting Tuesday, 11/11 @1pm ET
cancel

Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation.

Publication ,  Journal Article
Zhang, W; Roy Burman, SS; Chen, J; Donovan, KA; Cao, Y; Shu, C; Zhang, B; Zeng, Z; Gu, S; Zhang, Y; Li, D; Fischer, ES; Tokheim, C; Shirley Liu, X
Published in: Genomics Proteomics Bioinformatics
October 2022

Targeted protein degradation (TPD) has rapidly emerged as a therapeutic modality to eliminate previously undruggable proteins by repurposing the cell's endogenous protein degradation machinery. However, the susceptibility of proteins for targeting by TPD approaches, termed "degradability", is largely unknown. Here, we developed a machine learning model, model-free analysis of protein degradability (MAPD), to predict degradability from features intrinsic to protein targets. MAPD shows accurate performance in predicting kinases that are degradable by TPD compounds [with an area under the precision-recall curve (AUPRC) of 0.759 and an area under the receiver operating characteristic curve (AUROC) of 0.775] and is likely generalizable to independent non-kinase proteins. We found five features with statistical significance to achieve optimal prediction, with ubiquitination potential being the most predictive. By structural modeling, we found that E2-accessible ubiquitination sites, but not lysine residues in general, are particularly associated with kinase degradability. Finally, we extended MAPD predictions to the entire proteome to find 964 disease-causing proteins (including proteins encoded by 278 cancer genes) that may be tractable to TPD drug development.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Genomics Proteomics Bioinformatics

DOI

EISSN

2210-3244

Publication Date

October 2022

Volume

20

Issue

5

Start / End Page

882 / 898

Location

England

Related Subject Headings

  • Ubiquitination
  • Proteome
  • Proteolysis
  • Machine Learning
  • Lysine
  • Bioinformatics
  • 49 Mathematical sciences
  • 46 Information and computing sciences
  • 31 Biological sciences
  • 08 Information and Computing Sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Zhang, W., Roy Burman, S. S., Chen, J., Donovan, K. A., Cao, Y., Shu, C., … Shirley Liu, X. (2022). Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation. Genomics Proteomics Bioinformatics, 20(5), 882–898. https://doi.org/10.1016/j.gpb.2022.11.008
Zhang, Wubing, Shourya S. Roy Burman, Jiaye Chen, Katherine A. Donovan, Yang Cao, Chelsea Shu, Boning Zhang, et al. “Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation.Genomics Proteomics Bioinformatics 20, no. 5 (October 2022): 882–98. https://doi.org/10.1016/j.gpb.2022.11.008.
Zhang W, Roy Burman SS, Chen J, Donovan KA, Cao Y, Shu C, et al. Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation. Genomics Proteomics Bioinformatics. 2022 Oct;20(5):882–98.
Zhang, Wubing, et al. “Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation.Genomics Proteomics Bioinformatics, vol. 20, no. 5, Oct. 2022, pp. 882–98. Pubmed, doi:10.1016/j.gpb.2022.11.008.
Zhang W, Roy Burman SS, Chen J, Donovan KA, Cao Y, Shu C, Zhang B, Zeng Z, Gu S, Zhang Y, Li D, Fischer ES, Tokheim C, Shirley Liu X. Machine Learning Modeling of Protein-intrinsic Features Predicts Tractability of Targeted Protein Degradation. Genomics Proteomics Bioinformatics. 2022 Oct;20(5):882–898.

Published In

Genomics Proteomics Bioinformatics

DOI

EISSN

2210-3244

Publication Date

October 2022

Volume

20

Issue

5

Start / End Page

882 / 898

Location

England

Related Subject Headings

  • Ubiquitination
  • Proteome
  • Proteolysis
  • Machine Learning
  • Lysine
  • Bioinformatics
  • 49 Mathematical sciences
  • 46 Information and computing sciences
  • 31 Biological sciences
  • 08 Information and Computing Sciences