Skip to main content
Journal cover image

DOME: Directional medical embedding vectors from Electronic Health Records.

Publication ,  Journal Article
Wen, J; Xue, H; Rush, E; Panickan, VA; Cai, T; Zhou, D; Ho, Y-L; Costa, L; Begoli, E; Hong, C; Gaziano, JM; Cho, K; Liao, KP; Lu, J; Cai, T
Published in: J Biomed Inform
February 2025

MOTIVATION: The increasing availability of Electronic Health Record (EHR) systems has created enormous potential for translational research. Recent developments in representation learning techniques have led to effective large-scale representations of EHR concepts along with knowledge graphs that empower downstream EHR studies. However, most existing methods require training with patient-level data, limiting their abilities to expand the training with multi-institutional EHR data. On the other hand, scalable approaches that only require summary-level data do not incorporate temporal dependencies between concepts. METHODS: We introduce a DirectiOnal Medical Embedding (DOME) algorithm to encode temporally directional relationships between medical concepts, using summary-level EHR data. Specifically, DOME first aggregates patient-level EHR data into an asymmetric co-occurrence matrix. Then it computes two Positive Pointwise Mutual Information (PPMI) matrices to correspondingly encode the pairwise prior and posterior dependencies between medical concepts. Following that, a joint matrix factorization is performed on the two PPMI matrices, which results in three vectors for each concept: a semantic embedding and two directional context embeddings. They collectively provide a comprehensive depiction of the temporal relationship between EHR concepts. RESULTS: We highlight the advantages and translational potential of DOME through three sets of validation studies. First, DOME consistently improves existing direction-agnostic embedding vectors for disease risk prediction in several diseases, for example achieving a relative gain of 5.5% in the area under the receiver operating characteristic (AUROC) for lung cancer. Second, DOME excels in directional drug-disease relationship inference by successfully differentiating between drug side effects and indications, correspondingly achieving relative AUROC gain over the state-of-the-art methods by 10.8% and 6.6%. Finally, DOME effectively constructs directional knowledge graphs, which distinguish disease risk factors from comorbidities, thereby revealing disease progression trajectories. The source codes are provided at https://github.com/celehs/Directional-EHR-embedding.

Duke Scholars

Published In

J Biomed Inform

DOI

EISSN

1532-0480

Publication Date

February 2025

Volume

162

Start / End Page

104768

Location

United States

Related Subject Headings

  • Medical Informatics
  • Medical Informatics
  • Humans
  • Electronic Health Records
  • Biomedical Engineering
  • Algorithms
  • 4601 Applied computing
  • 4203 Health services and systems
  • 11 Medical and Health Sciences
  • 08 Information and Computing Sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Wen, J., Xue, H., Rush, E., Panickan, V. A., Cai, T., Zhou, D., … Lu, J. (2025). DOME: Directional medical embedding vectors from Electronic Health Records. J Biomed Inform, 162, 104768. https://doi.org/10.1016/j.jbi.2024.104768
Wen, Jun, Hao Xue, Everett Rush, Vidul A. Panickan, Tianrun Cai, Doudou Zhou, Yuk-Lam Ho, et al. “DOME: Directional medical embedding vectors from Electronic Health Records.J Biomed Inform 162 (February 2025): 104768. https://doi.org/10.1016/j.jbi.2024.104768.
Wen J, Xue H, Rush E, Panickan VA, Cai T, Zhou D, et al. DOME: Directional medical embedding vectors from Electronic Health Records. J Biomed Inform. 2025 Feb;162:104768.
Wen, Jun, et al. “DOME: Directional medical embedding vectors from Electronic Health Records.J Biomed Inform, vol. 162, Feb. 2025, p. 104768. Pubmed, doi:10.1016/j.jbi.2024.104768.
Wen J, Xue H, Rush E, Panickan VA, Cai T, Zhou D, Ho Y-L, Costa L, Begoli E, Hong C, Gaziano JM, Cho K, Liao KP, Lu J. DOME: Directional medical embedding vectors from Electronic Health Records. J Biomed Inform. 2025 Feb;162:104768.
Journal cover image

Published In

J Biomed Inform

DOI

EISSN

1532-0480

Publication Date

February 2025

Volume

162

Start / End Page

104768

Location

United States

Related Subject Headings

  • Medical Informatics
  • Medical Informatics
  • Humans
  • Electronic Health Records
  • Biomedical Engineering
  • Algorithms
  • 4601 Applied computing
  • 4203 Health services and systems
  • 11 Medical and Health Sciences
  • 08 Information and Computing Sciences