Skip to main content
Journal cover image

Classifying Pseudogout Using Machine Learning Approaches With Electronic Health Record Data.

Publication ,  Journal Article
Tedeschi, SK; Cai, T; He, Z; Ahuja, Y; Hong, C; Yates, KA; Dahal, K; Xu, C; Lyu, H; Yoshida, K; Solomon, DH; Cai, T; Liao, KP
Published in: Arthritis Care Res (Hoboken)
March 2021

OBJECTIVE: Identifying pseudogout in large data sets is difficult due to its episodic nature and a lack of billing codes specific to this acute subtype of calcium pyrophosphate (CPP) deposition disease. The objective of this study was to evaluate a novel machine learning approach for classifying pseudogout using electronic health record (EHR) data. METHODS: We created an EHR data mart of patients with ≥1 relevant billing code or ≥2 natural language processing (NLP) mentions of pseudogout or chondrocalcinosis, 1991-2017. We selected 900 subjects for gold standard chart review for definite pseudogout (synovitis + synovial fluid CPP crystals), probable pseudogout (synovitis + chondrocalcinosis), or not pseudogout. We applied a topic modeling approach to identify definite/probable pseudogout. A combined algorithm included topic modeling plus manually reviewed CPP crystal results. We compared algorithm performance and cohorts identified by billing codes, the presence of CPP crystals, topic modeling, and a combined algorithm. RESULTS: Among 900 subjects, 123 (13.7%) had pseudogout by chart review (68 definite, 55 probable). Billing codes had a sensitivity of 65% and a positive predictive value (PPV) of 22% for pseudogout. The presence of CPP crystals had a sensitivity of 29% and a PPV of 92%. Without using CPP crystal results, topic modeling had a sensitivity of 29% and a PPV of 79%. The combined algorithm yielded a sensitivity of 42% and a PPV of 81%. The combined algorithm identified 50% more patients than the presence of CPP crystals; the latter captured a portion of definite pseudogout and missed probable pseudogout. CONCLUSION: For pseudogout, an episodic disease with no specific billing code, combining NLP, machine learning methods, and synovial fluid laboratory results yielded an algorithm that significantly boosted the PPV compared to billing codes.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Arthritis Care Res (Hoboken)

DOI

EISSN

2151-4658

Publication Date

March 2021

Volume

73

Issue

3

Start / End Page

442 / 448

Location

United States

Related Subject Headings

  • Natural Language Processing
  • Middle Aged
  • Male
  • Machine Learning
  • Humans
  • Female
  • Electronic Health Records
  • Data Mining
  • Chondrocalcinosis
  • Aged, 80 and over
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Tedeschi, S. K., Cai, T., He, Z., Ahuja, Y., Hong, C., Yates, K. A., … Liao, K. P. (2021). Classifying Pseudogout Using Machine Learning Approaches With Electronic Health Record Data. Arthritis Care Res (Hoboken), 73(3), 442–448. https://doi.org/10.1002/acr.24132
Tedeschi, Sara K., Tianrun Cai, Zeling He, Yuri Ahuja, Chuan Hong, Katherine A. Yates, Kumar Dahal, et al. “Classifying Pseudogout Using Machine Learning Approaches With Electronic Health Record Data.Arthritis Care Res (Hoboken) 73, no. 3 (March 2021): 442–48. https://doi.org/10.1002/acr.24132.
Tedeschi SK, Cai T, He Z, Ahuja Y, Hong C, Yates KA, et al. Classifying Pseudogout Using Machine Learning Approaches With Electronic Health Record Data. Arthritis Care Res (Hoboken). 2021 Mar;73(3):442–8.
Tedeschi, Sara K., et al. “Classifying Pseudogout Using Machine Learning Approaches With Electronic Health Record Data.Arthritis Care Res (Hoboken), vol. 73, no. 3, Mar. 2021, pp. 442–48. Pubmed, doi:10.1002/acr.24132.
Tedeschi SK, Cai T, He Z, Ahuja Y, Hong C, Yates KA, Dahal K, Xu C, Lyu H, Yoshida K, Solomon DH, Liao KP. Classifying Pseudogout Using Machine Learning Approaches With Electronic Health Record Data. Arthritis Care Res (Hoboken). 2021 Mar;73(3):442–448.
Journal cover image

Published In

Arthritis Care Res (Hoboken)

DOI

EISSN

2151-4658

Publication Date

March 2021

Volume

73

Issue

3

Start / End Page

442 / 448

Location

United States

Related Subject Headings

  • Natural Language Processing
  • Middle Aged
  • Male
  • Machine Learning
  • Humans
  • Female
  • Electronic Health Records
  • Data Mining
  • Chondrocalcinosis
  • Aged, 80 and over