Skip to main content
Journal cover image

Scalable relevance ranking algorithm via semantic similarity assessment improves efficiency of medical chart review.

Publication ,  Journal Article
Cai, T; He, Z; Hong, C; Zhang, Y; Ho, Y-L; Honerlaw, J; Geva, A; Ayakulangara Panickan, V; King, A; Gagnon, DR; Gaziano, M; Cho, K; Liao, K; Cai, T
Published in: J Biomed Inform
August 2022

OBJECTIVE: Accurately assigning phenotype information to individual patients via computational phenotyping using Electronic Health Records (EHRs) has been seen as the first step towards enabling EHRs for precision medicine research. Chart review labels annotated by clinical experts, also known as "gold standard" labels, are essential for the development and validation of computational phenotyping algorithms. However, given the complexity of EHR systems, the process of chart review is both labor intensive and time consuming. We propose a fully automated algorithm, referred to as pGUESS, to rank EHR notes according to their relevance to a given phenotype. By identifying the most relevant notes, pGUESS can greatly improve the efficiency and accuracy of chart reviews. METHOD: pGUESS uses prior guided semantic similarity to measure the informativeness of a clinical note to a given phenotype. We first select candidate clinical concepts from a pool of comprehensive medical concepts using public knowledge sources and then derive the semantic embedding vector (SEV) for a reference article (SEVref) and each note (SEVnote). The algorithm scores the relevance of a note as the cosine similarity between SEVnote and SEVref. RESULTS: The algorithm was validated against four sets of 200 notes that were manually annotated by clinical experts to assess their informativeness to one of three disease phenotypes. pGUESS algorithm substantially outperforms existing unsupervised approaches for classifying the relevance status with respect to both accuracy and scalability across phenotypes. Averaging over the three phenotypes, the rank correlation between the algorithm ranking and gold standard label was 0.64 for pGUESS, but only 0.47 and 0.35 for the next two best performing algorithms. pGUESS is also much more computationally scalable compared to existing algorithms. CONCLUSION: pGUESS algorithm can substantially reduce the burden of chart review and holds potential in improving the efficiency and accuracy of human annotation.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

J Biomed Inform

DOI

EISSN

1532-0480

Publication Date

August 2022

Volume

132

Start / End Page

104109

Location

United States

Related Subject Headings

  • Semantics
  • Precision Medicine
  • Phenotype
  • Natural Language Processing
  • Medical Informatics
  • Humans
  • Electronic Health Records
  • Biomedical Engineering
  • Algorithms
  • 4601 Applied computing
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Cai, T., He, Z., Hong, C., Zhang, Y., Ho, Y.-L., Honerlaw, J., … Liao, K. (2022). Scalable relevance ranking algorithm via semantic similarity assessment improves efficiency of medical chart review. J Biomed Inform, 132, 104109. https://doi.org/10.1016/j.jbi.2022.104109
Cai, Tianrun, Zeling He, Chuan Hong, Yichi Zhang, Yuk-Lam Ho, Jacqueline Honerlaw, Alon Geva, et al. “Scalable relevance ranking algorithm via semantic similarity assessment improves efficiency of medical chart review.J Biomed Inform 132 (August 2022): 104109. https://doi.org/10.1016/j.jbi.2022.104109.
Cai T, He Z, Hong C, Zhang Y, Ho Y-L, Honerlaw J, et al. Scalable relevance ranking algorithm via semantic similarity assessment improves efficiency of medical chart review. J Biomed Inform. 2022 Aug;132:104109.
Cai, Tianrun, et al. “Scalable relevance ranking algorithm via semantic similarity assessment improves efficiency of medical chart review.J Biomed Inform, vol. 132, Aug. 2022, p. 104109. Pubmed, doi:10.1016/j.jbi.2022.104109.
Cai T, He Z, Hong C, Zhang Y, Ho Y-L, Honerlaw J, Geva A, Ayakulangara Panickan V, King A, Gagnon DR, Gaziano M, Cho K, Liao K. Scalable relevance ranking algorithm via semantic similarity assessment improves efficiency of medical chart review. J Biomed Inform. 2022 Aug;132:104109.
Journal cover image

Published In

J Biomed Inform

DOI

EISSN

1532-0480

Publication Date

August 2022

Volume

132

Start / End Page

104109

Location

United States

Related Subject Headings

  • Semantics
  • Precision Medicine
  • Phenotype
  • Natural Language Processing
  • Medical Informatics
  • Humans
  • Electronic Health Records
  • Biomedical Engineering
  • Algorithms
  • 4601 Applied computing