Skip to main content

Identification of Patients With Metastatic Prostate Cancer With Natural Language Processing and Machine Learning.

Publication ,  Journal Article
Yang, R; Zhu, D; Howard, LE; De Hoedt, A; Williams, SB; Freedland, SJ; Klaassen, Z
Published in: JCO clinical cancer informatics
October 2022

Understanding treatment patterns and effectiveness for patients with metastatic prostate cancer (mPCa) is dependent on accurate assessment of metastatic status. The objective was to develop a natural language processing (NLP) model for identifying patients with mPCa and evaluate the model's performance against chart-reviewed data and an International Classification of Diseases (ICD) 9/10 code-based method.In total, 139,057 radiology reports on 6,211 unique patients from the Department of Veterans Affairs were used. The gold standard was metastases by detailed chart review of radiology reports. NLP performance was assessed by sensitivity, specificity, positive predictive value, negative predictive value, and date of metastases detection. Receiver operating characteristic curves was used to assess model performance.When compared with chart review, the NLP model had high sensitivity and specificity (85% and 96%, respectively). The NLP model was able to predict patient-level metastasis status with a sensitivity of 91% and specificity of 81%, whereas sensitivity and specificity using ICD9/10 billing codes were 73% and 86%, respectively. For the NLP model, date of metastases detection was exactly concordant and within < 1 week in 55% and 58% of patients, compared with 8% and 17%, respectively, using the ICD9/10 billing codes method. The area under the curve for the NLP model was 0.911. A limitation is the NLP model was developed on the basis of a subset of patients with mPCa and may not be generalizable to all patients with mPCa.This population-level NLP model for identifying patients with mPCa was more accurate than using ICD9/10 billing codes when compared with chart-reviewed data. Upon further validation, this model may allow for efficient population-level identification of patients with mPCa.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

JCO clinical cancer informatics

DOI

EISSN

2473-4276

ISSN

2473-4276

Publication Date

October 2022

Volume

6

Start / End Page

e2100071

Related Subject Headings

  • Prostatic Neoplasms
  • Natural Language Processing
  • Male
  • Machine Learning
  • Humans
  • Electronic Health Records
  • Algorithms
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Yang, R., Zhu, D., Howard, L. E., De Hoedt, A., Williams, S. B., Freedland, S. J., & Klaassen, Z. (2022). Identification of Patients With Metastatic Prostate Cancer With Natural Language Processing and Machine Learning. JCO Clinical Cancer Informatics, 6, e2100071. https://doi.org/10.1200/cci.21.00071
Yang, Ruixin, Di Zhu, Lauren E. Howard, Amanda De Hoedt, Stephen B. Williams, Stephen J. Freedland, and Zachary Klaassen. “Identification of Patients With Metastatic Prostate Cancer With Natural Language Processing and Machine Learning.JCO Clinical Cancer Informatics 6 (October 2022): e2100071. https://doi.org/10.1200/cci.21.00071.
Yang R, Zhu D, Howard LE, De Hoedt A, Williams SB, Freedland SJ, et al. Identification of Patients With Metastatic Prostate Cancer With Natural Language Processing and Machine Learning. JCO clinical cancer informatics. 2022 Oct;6:e2100071.
Yang, Ruixin, et al. “Identification of Patients With Metastatic Prostate Cancer With Natural Language Processing and Machine Learning.JCO Clinical Cancer Informatics, vol. 6, Oct. 2022, p. e2100071. Epmc, doi:10.1200/cci.21.00071.
Yang R, Zhu D, Howard LE, De Hoedt A, Williams SB, Freedland SJ, Klaassen Z. Identification of Patients With Metastatic Prostate Cancer With Natural Language Processing and Machine Learning. JCO clinical cancer informatics. 2022 Oct;6:e2100071.

Published In

JCO clinical cancer informatics

DOI

EISSN

2473-4276

ISSN

2473-4276

Publication Date

October 2022

Volume

6

Start / End Page

e2100071

Related Subject Headings

  • Prostatic Neoplasms
  • Natural Language Processing
  • Male
  • Machine Learning
  • Humans
  • Electronic Health Records
  • Algorithms