Skip to main content
Journal cover image

Contrastive learning in protein language space predicts interactions between drugs and protein targets.

Publication ,  Journal Article
Singh, R; Sledzieski, S; Bryson, B; Cowen, L; Berger, B
Published in: Proc Natl Acad Sci U S A
June 13, 2023

Sequence-based prediction of drug-target interactions has the potential to accelerate drug discovery by complementing experimental screens. Such computational prediction needs to be generalizable and scalable while remaining sensitive to subtle variations in the inputs. However, current computational techniques fail to simultaneously meet these goals, often sacrificing performance of one to achieve the others. We develop a deep learning model, ConPLex, successfully leveraging the advances in pretrained protein language models ("PLex") and employing a protein-anchored contrastive coembedding ("Con") to outperform state-of-the-art approaches. ConPLex achieves high accuracy, broad adaptivity to unseen data, and specificity against decoy compounds. It makes predictions of binding based on the distance between learned representations, enabling predictions at the scale of massive compound libraries and the human proteome. Experimental testing of 19 kinase-drug interaction predictions validated 12 interactions, including four with subnanomolar affinity, plus a strongly binding EPHB1 inhibitor (KD = 1.3 nM). Furthermore, ConPLex embeddings are interpretable, which enables us to visualize the drug-target embedding space and use embeddings to characterize the function of human cell-surface proteins. We anticipate that ConPLex will facilitate efficient drug discovery by making highly sensitive in silico drug screening feasible at the genome scale. ConPLex is available open source at https://ConPLex.csail.mit.edu.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Proc Natl Acad Sci U S A

DOI

EISSN

1091-6490

Publication Date

June 13, 2023

Volume

120

Issue

24

Start / End Page

e2220778120

Location

United States

Related Subject Headings

  • Proteins
  • Language
  • Humans
  • Drug Evaluation, Preclinical
  • Drug Discovery
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Singh, R., Sledzieski, S., Bryson, B., Cowen, L., & Berger, B. (2023). Contrastive learning in protein language space predicts interactions between drugs and protein targets. Proc Natl Acad Sci U S A, 120(24), e2220778120. https://doi.org/10.1073/pnas.2220778120
Singh, Rohit, Samuel Sledzieski, Bryan Bryson, Lenore Cowen, and Bonnie Berger. “Contrastive learning in protein language space predicts interactions between drugs and protein targets.Proc Natl Acad Sci U S A 120, no. 24 (June 13, 2023): e2220778120. https://doi.org/10.1073/pnas.2220778120.
Singh R, Sledzieski S, Bryson B, Cowen L, Berger B. Contrastive learning in protein language space predicts interactions between drugs and protein targets. Proc Natl Acad Sci U S A. 2023 Jun 13;120(24):e2220778120.
Singh, Rohit, et al. “Contrastive learning in protein language space predicts interactions between drugs and protein targets.Proc Natl Acad Sci U S A, vol. 120, no. 24, June 2023, p. e2220778120. Pubmed, doi:10.1073/pnas.2220778120.
Singh R, Sledzieski S, Bryson B, Cowen L, Berger B. Contrastive learning in protein language space predicts interactions between drugs and protein targets. Proc Natl Acad Sci U S A. 2023 Jun 13;120(24):e2220778120.
Journal cover image

Published In

Proc Natl Acad Sci U S A

DOI

EISSN

1091-6490

Publication Date

June 13, 2023

Volume

120

Issue

24

Start / End Page

e2220778120

Location

United States

Related Subject Headings

  • Proteins
  • Language
  • Humans
  • Drug Evaluation, Preclinical
  • Drug Discovery