Skip to main content

Learning a CoNCISE Language for Small-Molecule Binding

Publication ,  Chapter
Erden, M; Devkota, K; Varghese, L; Cowen, L; Singh, R
January 1, 2025

Rapid advances in deep learning have improved in silico methods for drug-target interaction (DTI) prediction. However, current methods struggle to scale to catalogs listing billions of commercially-available small molecules. Here, we introduce CoNCISE, a method that accelerates DTI prediction by 23 orders of magnitude while maintaining high accuracy. CoNCISE employs a novel vector-quantized codebook approach and residual-learning-based training of hierarchical codes. Strikingly, we find that binding-specificity information in the small molecule space can be compressed into just 15 bits per compound, grouping all small molecules into 32,768 hierarchically-organized binding categories. Our DTI architecture combines these compact ligand representations with fixed-length protein embeddings in a cross-attention framework, achieving state-of-the-art prediction accuracy at unprecedented speed. We demonstrate CoNCISE’s practical utility by indexing 6.4 billion ligands from the Enamine dataset, enabling researchers to query vast chemical libraries against a protein target in seconds.

Duke Scholars

DOI

Publication Date

January 1, 2025

Volume

15647 LNBI

Start / End Page

260 / 263

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 46 Information and computing sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Erden, M., Devkota, K., Varghese, L., Cowen, L., & Singh, R. (2025). Learning a CoNCISE Language for Small-Molecule Binding (Vol. 15647 LNBI, pp. 260–263). https://doi.org/10.1007/978-3-031-90252-9_17
Erden, M., K. Devkota, L. Varghese, L. Cowen, and R. Singh. “Learning a CoNCISE Language for Small-Molecule Binding,” 15647 LNBI:260–63, 2025. https://doi.org/10.1007/978-3-031-90252-9_17.
Erden M, Devkota K, Varghese L, Cowen L, Singh R. Learning a CoNCISE Language for Small-Molecule Binding. In 2025. p. 260–3.
Erden, M., et al. Learning a CoNCISE Language for Small-Molecule Binding. Vol. 15647 LNBI, 2025, pp. 260–63. Scopus, doi:10.1007/978-3-031-90252-9_17.
Erden M, Devkota K, Varghese L, Cowen L, Singh R. Learning a CoNCISE Language for Small-Molecule Binding. 2025. p. 260–263.

DOI

Publication Date

January 1, 2025

Volume

15647 LNBI

Start / End Page

260 / 263

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 46 Information and computing sciences