Skip to main content

Multimodal representation learning for predicting molecule-disease relations.

Publication ,  Journal Article
Wen, J; Zhang, X; Rush, E; Panickan, VA; Li, X; Cai, T; Zhou, D; Ho, Y-L; Costa, L; Begoli, E; Hong, C; Gaziano, JM; Cho, K; Lu, J; Cai, T ...
Published in: Bioinformatics
February 3, 2023

MOTIVATION: Predicting molecule-disease indications and side effects is important for drug development and pharmacovigilance. Comprehensively mining molecule-molecule, molecule-disease and disease-disease semantic dependencies can potentially improve prediction performance. METHODS: We introduce a Multi-Modal REpresentation Mapping Approach to Predicting molecular-disease relations (M2REMAP) by incorporating clinical semantics learned from electronic health records (EHR) of 12.6 million patients. Specifically, M2REMAP first learns a multimodal molecule representation that synthesizes chemical property and clinical semantic information by mapping molecule chemicals via a deep neural network onto the clinical semantic embedding space shared by drugs, diseases and other common clinical concepts. To infer molecule-disease relations, M2REMAP combines multimodal molecule representation and disease semantic embedding to jointly infer indications and side effects. RESULTS: We extensively evaluate M2REMAP on molecule indications, side effects and interactions. Results show that incorporating EHR embeddings improves performance significantly, for example, attaining an improvement over the baseline models by 23.6% in PRC-AUC on indications and 23.9% on side effects. Further, M2REMAP overcomes the limitation of existing methods and effectively predicts drugs for novel diseases and emerging pathogens. AVAILABILITY AND IMPLEMENTATION: The code is available at https://github.com/celehs/M2REMAP, and prediction results are provided at https://shiny.parse-health.org/drugs-diseases-dev/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Bioinformatics

DOI

EISSN

1367-4811

Publication Date

February 3, 2023

Volume

39

Issue

2

Location

England

Related Subject Headings

  • Pharmacovigilance
  • Neural Networks, Computer
  • Humans
  • Electronic Health Records
  • Drug-Related Side Effects and Adverse Reactions
  • Drug Development
  • Bioinformatics
  • 49 Mathematical sciences
  • 46 Information and computing sciences
  • 31 Biological sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Wen, J., Zhang, X., Rush, E., Panickan, V. A., Li, X., Cai, T., … Zitnik, M. (2023). Multimodal representation learning for predicting molecule-disease relations. Bioinformatics, 39(2). https://doi.org/10.1093/bioinformatics/btad085
Wen, Jun, Xiang Zhang, Everett Rush, Vidul A. Panickan, Xingyu Li, Tianrun Cai, Doudou Zhou, et al. “Multimodal representation learning for predicting molecule-disease relations.Bioinformatics 39, no. 2 (February 3, 2023). https://doi.org/10.1093/bioinformatics/btad085.
Wen J, Zhang X, Rush E, Panickan VA, Li X, Cai T, et al. Multimodal representation learning for predicting molecule-disease relations. Bioinformatics. 2023 Feb 3;39(2).
Wen, Jun, et al. “Multimodal representation learning for predicting molecule-disease relations.Bioinformatics, vol. 39, no. 2, Feb. 2023. Pubmed, doi:10.1093/bioinformatics/btad085.
Wen J, Zhang X, Rush E, Panickan VA, Li X, Cai T, Zhou D, Ho Y-L, Costa L, Begoli E, Hong C, Gaziano JM, Cho K, Lu J, Liao KP, Zitnik M. Multimodal representation learning for predicting molecule-disease relations. Bioinformatics. 2023 Feb 3;39(2).

Published In

Bioinformatics

DOI

EISSN

1367-4811

Publication Date

February 3, 2023

Volume

39

Issue

2

Location

England

Related Subject Headings

  • Pharmacovigilance
  • Neural Networks, Computer
  • Humans
  • Electronic Health Records
  • Drug-Related Side Effects and Adverse Reactions
  • Drug Development
  • Bioinformatics
  • 49 Mathematical sciences
  • 46 Information and computing sciences
  • 31 Biological sciences