Skip to main content
construction release_alert
The Scholars Team is working with OIT to resolve some issues with the Scholars search index
cancel

A Generalized Machine Learning Model for Identifying Congenital Heart Defects (CHDs) Using ICD Codes.

Publication ,  Journal Article
Shi, H; Book, WM; Ivey, LC; Rodriguez, FH; Raskind-Hood, C; Downing, KF; Farr, SL; McCracken, CE; Leedom, VO; Haynes, SE; Amouzou, S ...
Published in: Birth Defects Res
February 2025

BACKGROUND: International Classification of Diseases (ICD) codes utilized for congenital heart defect (CHD) case identification in datasets have substantial false-positive (FP) rates. Incorporating machine learning (ML) algorithms following case selection by ICD codes may improve the accuracy of CHD identification, enhancing surveillance efforts. METHODS: Traditional ML methods were applied to four encounter-level datasets, 2010-2019, for 3334 patients with validated diagnoses and with at least one CHD ICD code identified. A 5-fold cross-validation approach was applied to the dataset to determine the set of overlapping important features best classifying CHD cases. Training and testing combinations were explored to determine the approach yielding the most accurate CHD classification. RESULTS: CHD ICD positive predictive values (PPVs) by site ranged from 53.2% to 84.0%. The ML algorithm achieved a PPV of 95% (1273/1340) for the four-site dataset with a false-negative (FN) rate of 33% (639/1912) by choosing an operating point prioritizing PPV from the PPV-FN rate curve. XGBoost reduced 2105 Clinical Classification Software (CCS) features to 137 that identified those with true-positive (TP) CHD and false-positive FP classification. CONCLUSION: Applying ML algorithms following case selection by CHD-related ICD codes improved the accuracy of identifying TP true-positive CHD cases.

Duke Scholars

Published In

Birth Defects Res

DOI

EISSN

2472-1727

Publication Date

February 2025

Volume

117

Issue

2

Start / End Page

e2440

Location

United States

Related Subject Headings

  • Male
  • Machine Learning
  • International Classification of Diseases
  • Infant, Newborn
  • Infant
  • Humans
  • Heart Defects, Congenital
  • Female
  • Algorithms
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Shi, H., Book, W. M., Ivey, L. C., Rodriguez, F. H., Raskind-Hood, C., Downing, K. F., … Kamaleswaran, R. (2025). A Generalized Machine Learning Model for Identifying Congenital Heart Defects (CHDs) Using ICD Codes. Birth Defects Res, 117(2), e2440. https://doi.org/10.1002/bdr2.2440
Shi, Haoming, Wendy M. Book, Lindsey C. Ivey, Fred H. Rodriguez, Cheryl Raskind-Hood, Karrie F. Downing, Sherry L. Farr, et al. “A Generalized Machine Learning Model for Identifying Congenital Heart Defects (CHDs) Using ICD Codes.Birth Defects Res 117, no. 2 (February 2025): e2440. https://doi.org/10.1002/bdr2.2440.
Shi H, Book WM, Ivey LC, Rodriguez FH, Raskind-Hood C, Downing KF, et al. A Generalized Machine Learning Model for Identifying Congenital Heart Defects (CHDs) Using ICD Codes. Birth Defects Res. 2025 Feb;117(2):e2440.
Shi, Haoming, et al. “A Generalized Machine Learning Model for Identifying Congenital Heart Defects (CHDs) Using ICD Codes.Birth Defects Res, vol. 117, no. 2, Feb. 2025, p. e2440. Pubmed, doi:10.1002/bdr2.2440.
Shi H, Book WM, Ivey LC, Rodriguez FH, Raskind-Hood C, Downing KF, Farr SL, McCracken CE, Leedom VO, Haynes SE, Amouzou S, Sameni R, Kamaleswaran R. A Generalized Machine Learning Model for Identifying Congenital Heart Defects (CHDs) Using ICD Codes. Birth Defects Res. 2025 Feb;117(2):e2440.

Published In

Birth Defects Res

DOI

EISSN

2472-1727

Publication Date

February 2025

Volume

117

Issue

2

Start / End Page

e2440

Location

United States

Related Subject Headings

  • Male
  • Machine Learning
  • International Classification of Diseases
  • Infant, Newborn
  • Infant
  • Humans
  • Heart Defects, Congenital
  • Female
  • Algorithms