Skip to main content
Journal cover image

Developing and validating a natural language processing algorithm to extract preoperative cannabis use status documentation from unstructured narrative clinical notes.

Publication ,  Journal Article
Sajdeya, R; Mardini, MT; Tighe, PJ; Ison, RL; Bai, C; Jugl, S; Hanzhi, G; Zandbiglari, K; Adiba, FI; Winterstein, AG; Pearson, TA; Cook, RL ...
Published in: J Am Med Inform Assoc
July 19, 2023

OBJECTIVE: This study aimed to develop a natural language processing algorithm (NLP) using machine learning (ML) techniques to identify and classify documentation of preoperative cannabis use status. MATERIALS AND METHODS: We developed and applied a keyword search strategy to identify documentation of preoperative cannabis use status in clinical documentation within 60 days of surgery. We manually reviewed matching notes to classify each documentation into 8 different categories based on context, time, and certainty of cannabis use documentation. We applied 2 conventional ML and 3 deep learning models against manual annotation. We externally validated our model using the MIMIC-III dataset. RESULTS: The tested classifiers achieved classification results close to human performance with up to 93% and 94% precision and 95% recall of preoperative cannabis use status documentation. External validation showed consistent results with up to 94% precision and recall. DISCUSSION: Our NLP model successfully replicated human annotation of preoperative cannabis use documentation, providing a baseline framework for identifying and classifying documentation of cannabis use. We add to NLP methods applied in healthcare for clinical concept extraction and classification, mainly concerning social determinants of health and substance use. Our systematically developed lexicon provides a comprehensive knowledge-based resource covering a wide range of cannabis-related concepts for future NLP applications. CONCLUSION: We demonstrated that documentation of preoperative cannabis use status could be accurately identified using an NLP algorithm. This approach can be employed to identify comparison groups based on cannabis exposure for growing research efforts aiming to guide cannabis-related clinical practices and policies.

Duke Scholars

Published In

J Am Med Inform Assoc

DOI

EISSN

1527-974X

Publication Date

July 19, 2023

Volume

30

Issue

8

Start / End Page

1418 / 1428

Location

England

Related Subject Headings

  • Natural Language Processing
  • Medical Informatics
  • Humans
  • Electronic Health Records
  • Documentation
  • Cannabis
  • Algorithms
  • 46 Information and computing sciences
  • 42 Health sciences
  • 32 Biomedical and clinical sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Sajdeya, R., Mardini, M. T., Tighe, P. J., Ison, R. L., Bai, C., Jugl, S., … Rouhizadeh, M. (2023). Developing and validating a natural language processing algorithm to extract preoperative cannabis use status documentation from unstructured narrative clinical notes. J Am Med Inform Assoc, 30(8), 1418–1428. https://doi.org/10.1093/jamia/ocad080
Sajdeya, Ruba, Mamoun T. Mardini, Patrick J. Tighe, Ronald L. Ison, Chen Bai, Sebastian Jugl, Gao Hanzhi, et al. “Developing and validating a natural language processing algorithm to extract preoperative cannabis use status documentation from unstructured narrative clinical notes.J Am Med Inform Assoc 30, no. 8 (July 19, 2023): 1418–28. https://doi.org/10.1093/jamia/ocad080.
Sajdeya R, Mardini MT, Tighe PJ, Ison RL, Bai C, Jugl S, et al. Developing and validating a natural language processing algorithm to extract preoperative cannabis use status documentation from unstructured narrative clinical notes. J Am Med Inform Assoc. 2023 Jul 19;30(8):1418–28.
Sajdeya, Ruba, et al. “Developing and validating a natural language processing algorithm to extract preoperative cannabis use status documentation from unstructured narrative clinical notes.J Am Med Inform Assoc, vol. 30, no. 8, July 2023, pp. 1418–28. Pubmed, doi:10.1093/jamia/ocad080.
Sajdeya R, Mardini MT, Tighe PJ, Ison RL, Bai C, Jugl S, Hanzhi G, Zandbiglari K, Adiba FI, Winterstein AG, Pearson TA, Cook RL, Rouhizadeh M. Developing and validating a natural language processing algorithm to extract preoperative cannabis use status documentation from unstructured narrative clinical notes. J Am Med Inform Assoc. 2023 Jul 19;30(8):1418–1428.
Journal cover image

Published In

J Am Med Inform Assoc

DOI

EISSN

1527-974X

Publication Date

July 19, 2023

Volume

30

Issue

8

Start / End Page

1418 / 1428

Location

England

Related Subject Headings

  • Natural Language Processing
  • Medical Informatics
  • Humans
  • Electronic Health Records
  • Documentation
  • Cannabis
  • Algorithms
  • 46 Information and computing sciences
  • 42 Health sciences
  • 32 Biomedical and clinical sciences