Skip to main content

Context-Based Identification of Muscle Invasion Status in Patients With Bladder Cancer Using Natural Language Processing.

Publication ,  Journal Article
Yang, R; Zhu, D; Howard, LE; De Hoedt, A; Schroeck, FR; Klaassen, Z; Freedland, SJ; Williams, SB
Published in: JCO clinical cancer informatics
January 2022

Mortality from bladder cancer (BC) increases exponentially once it invades the muscle, with inherent challenges delineating at the population level. We sought to develop and validate a natural language processing (NLP) model for automatically identifying patients with muscle-invasive bladder cancer (MIBC).All patients with a Current Procedural Terminology code for transurethral resection of bladder tumor (TURBT; n = 76,060) were selected from the Department of Veterans Affairs (VA) database. A sample of 600 patients (with 2,337 full-text notes) who had TURBT and confirmed pathology results were selected for NLP model development and validation. The NLP performance was assessed by calculating the sensitivity, specificity, positive predictive value, negative predictive value, F1 score, and overall accuracy at the individual note and patient levels.In the validation cohort, the NLP model had average overall accuracies of 94% and 96% at the note and patient levels. Specifically, the F1 score and overall accuracy for predicting muscle invasion at the patient level were 0.87% and 96%, respectively. The model classified nonmuscle-invasive bladder cancer (NMIBC) with overall accuracies of 90% and 93% at the note and patient levels. When applying the model to 71,200 patients VA-wide, the model classified 13,642 (19%) as having MIBC and 47,595 (66%) as NMIBC and was able to identify invasion status for 96% of patients with TURBT at the population level. Inherent limitations include a relatively small training set, given the size of the VA population.This NLP model, with high accuracy, may be a practical tool for efficiently identifying BC invasion status and aid in population-based BC research.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

JCO clinical cancer informatics

DOI

EISSN

2473-4276

ISSN

2473-4276

Publication Date

January 2022

Volume

6

Start / End Page

e2100097

Related Subject Headings

  • Urologic Surgical Procedures
  • Urinary Bladder Neoplasms
  • Rare Diseases
  • Natural Language Processing
  • Muscles
  • Male
  • Humans
  • Female
  • Cystectomy
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Yang, R., Zhu, D., Howard, L. E., De Hoedt, A., Schroeck, F. R., Klaassen, Z., … Williams, S. B. (2022). Context-Based Identification of Muscle Invasion Status in Patients With Bladder Cancer Using Natural Language Processing. JCO Clinical Cancer Informatics, 6, e2100097. https://doi.org/10.1200/cci.21.00097
Yang, Ruixin, Di Zhu, Lauren E. Howard, Amanda De Hoedt, Florian R. Schroeck, Zachary Klaassen, Stephen J. Freedland, and Stephen B. Williams. “Context-Based Identification of Muscle Invasion Status in Patients With Bladder Cancer Using Natural Language Processing.JCO Clinical Cancer Informatics 6 (January 2022): e2100097. https://doi.org/10.1200/cci.21.00097.
Yang R, Zhu D, Howard LE, De Hoedt A, Schroeck FR, Klaassen Z, et al. Context-Based Identification of Muscle Invasion Status in Patients With Bladder Cancer Using Natural Language Processing. JCO clinical cancer informatics. 2022 Jan;6:e2100097.
Yang, Ruixin, et al. “Context-Based Identification of Muscle Invasion Status in Patients With Bladder Cancer Using Natural Language Processing.JCO Clinical Cancer Informatics, vol. 6, Jan. 2022, p. e2100097. Epmc, doi:10.1200/cci.21.00097.
Yang R, Zhu D, Howard LE, De Hoedt A, Schroeck FR, Klaassen Z, Freedland SJ, Williams SB. Context-Based Identification of Muscle Invasion Status in Patients With Bladder Cancer Using Natural Language Processing. JCO clinical cancer informatics. 2022 Jan;6:e2100097.

Published In

JCO clinical cancer informatics

DOI

EISSN

2473-4276

ISSN

2473-4276

Publication Date

January 2022

Volume

6

Start / End Page

e2100097

Related Subject Headings

  • Urologic Surgical Procedures
  • Urinary Bladder Neoplasms
  • Rare Diseases
  • Natural Language Processing
  • Muscles
  • Male
  • Humans
  • Female
  • Cystectomy