Natural language processing for abstraction of cancer treatment toxicities: accuracy versus human experts.

Journal Article (Journal Article)

OBJECTIVES: Expert abstraction of acute toxicities is critical in oncology research but is labor-intensive and variable. We assessed the accuracy of a natural language processing (NLP) pipeline to extract symptoms from clinical notes compared to physicians. MATERIALS AND METHODS: Two independent reviewers identified present and negated National Cancer Institute Common Terminology Criteria for Adverse Events (CTCAE) v5.0 symptoms from 100 randomly selected notes for on-treatment visits during radiation therapy with adjudication by a third reviewer. A NLP pipeline based on Apache clinical Text Analysis Knowledge Extraction System was developed and used to extract CTCAE terms. Accuracy was assessed by precision, recall, and F1. RESULTS: The NLP pipeline demonstrated high accuracy for common physician-abstracted symptoms, such as radiation dermatitis (F1 0.88), fatigue (0.85), and nausea (0.88). NLP had poor sensitivity for negated symptoms. CONCLUSION: NLP accurately detects a subset of documented present CTCAE symptoms, though is limited for negated symptoms. It may facilitate strategies to more consistently identify toxicities during cancer therapy.

Full Text

Duke Authors

Cited Authors

  • Hong, JC; Fairchild, AT; Tanksley, JP; Palta, M; Tenenbaum, JD

Published Date

  • December 2020

Published In

Volume / Issue

  • 3 / 4

Start / End Page

  • 513 - 517

PubMed ID

  • 33623888

Pubmed Central ID

  • PMC7886534

Electronic International Standard Serial Number (EISSN)

  • 2574-2531

Digital Object Identifier (DOI)

  • 10.1093/jamiaopen/ooaa064


  • eng

Conference Location

  • United States