Skip to main content

Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield.

Publication ,  Journal Article
Hassanpour, S; Langlotz, CP; Amrhein, TJ; Befera, NT; Lungren, MP
Published in: AJR Am J Roentgenol
April 2017

OBJECTIVE: The purpose of this study is to evaluate the performance of a natural language processing (NLP) system in classifying a database of free-text knee MRI reports at two separate academic radiology practices. MATERIALS AND METHODS: An NLP system that uses terms and patterns in manually classified narrative knee MRI reports was constructed. The NLP system was trained and tested on expert-classified knee MRI reports from two major health care organizations. Radiology reports were modeled in the training set as vectors, and a support vector machine framework was used to train the classifier. A separate test set from each organization was used to evaluate the performance of the system. We evaluated the performance of the system both within and across organizations. Standard evaluation metrics, such as accuracy, precision, recall, and F1 score (i.e., the weighted average of the precision and recall), and their respective 95% CIs were used to measure the efficacy of our classification system. RESULTS: The accuracy for radiology reports that belonged to the model's clinically significant concept classes after training data from the same institution was good, yielding an F1 score greater than 90% (95% CI, 84.6-97.3%). Performance of the classifier on cross-institutional application without institution-specific training data yielded F1 scores of 77.6% (95% CI, 69.5-85.7%) and 90.2% (95% CI, 84.5-95.9%) at the two organizations studied. CONCLUSION: The results show excellent accuracy by the NLP machine learning classifier in classifying free-text knee MRI reports, supporting the institution-independent reproducibility of knee MRI report classification. Furthermore, the machine learning classifier performed well on free-text knee MRI reports from another institution. These data support the feasibility of multiinstitutional classification of radiologic imaging text reports with a single machine learning classifier without requiring institution-specific training data.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

AJR Am J Roentgenol

DOI

EISSN

1546-3141

Publication Date

April 2017

Volume

208

Issue

4

Start / End Page

750 / 753

Location

United States

Related Subject Headings

  • Workload
  • Support Vector Machine
  • Sensitivity and Specificity
  • Reproducibility of Results
  • Radiology Information Systems
  • Radiology Department, Hospital
  • Pattern Recognition, Automated
  • Nuclear Medicine & Medical Imaging
  • North Carolina
  • Natural Language Processing
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Hassanpour, S., Langlotz, C. P., Amrhein, T. J., Befera, N. T., & Lungren, M. P. (2017). Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield. AJR Am J Roentgenol, 208(4), 750–753. https://doi.org/10.2214/AJR.16.16128
Hassanpour, Saeed, Curtis P. Langlotz, Timothy J. Amrhein, Nicholas T. Befera, and Matthew P. Lungren. “Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield.AJR Am J Roentgenol 208, no. 4 (April 2017): 750–53. https://doi.org/10.2214/AJR.16.16128.
Hassanpour S, Langlotz CP, Amrhein TJ, Befera NT, Lungren MP. Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield. AJR Am J Roentgenol. 2017 Apr;208(4):750–3.
Hassanpour, Saeed, et al. “Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield.AJR Am J Roentgenol, vol. 208, no. 4, Apr. 2017, pp. 750–53. Pubmed, doi:10.2214/AJR.16.16128.
Hassanpour S, Langlotz CP, Amrhein TJ, Befera NT, Lungren MP. Performance of a Machine Learning Classifier of Knee MRI Reports in Two Large Academic Radiology Practices: A Tool to Estimate Diagnostic Yield. AJR Am J Roentgenol. 2017 Apr;208(4):750–753.

Published In

AJR Am J Roentgenol

DOI

EISSN

1546-3141

Publication Date

April 2017

Volume

208

Issue

4

Start / End Page

750 / 753

Location

United States

Related Subject Headings

  • Workload
  • Support Vector Machine
  • Sensitivity and Specificity
  • Reproducibility of Results
  • Radiology Information Systems
  • Radiology Department, Hospital
  • Pattern Recognition, Automated
  • Nuclear Medicine & Medical Imaging
  • North Carolina
  • Natural Language Processing