Scholars@Duke publication: A Generalized Machine Learning Model for Identifying Congenital Heart Defects (CHDs) Using ICD Codes.

A Generalized Machine Learning Model for Identifying Congenital Heart Defects (CHDs) Using ICD Codes.

Publication , Journal Article

Shi, H; Book, WM; Ivey, LC; Rodriguez, FH; Raskind-Hood, C; Downing, KF; Farr, SL; McCracken, CE; Leedom, VO; Haynes, SE; Amouzou, S ...

Published in: Birth Defects Res

February 2025

Published version (DOI) Link to item

BACKGROUND: International Classification of Diseases (ICD) codes utilized for congenital heart defect (CHD) case identification in datasets have substantial false-positive (FP) rates. Incorporating machine learning (ML) algorithms following case selection by ICD codes may improve the accuracy of CHD identification, enhancing surveillance efforts. METHODS: Traditional ML methods were applied to four encounter-level datasets, 2010-2019, for 3334 patients with validated diagnoses and with at least one CHD ICD code identified. A 5-fold cross-validation approach was applied to the dataset to determine the set of overlapping important features best classifying CHD cases. Training and testing combinations were explored to determine the approach yielding the most accurate CHD classification. RESULTS: CHD ICD positive predictive values (PPVs) by site ranged from 53.2% to 84.0%. The ML algorithm achieved a PPV of 95% (1273/1340) for the four-site dataset with a false-negative (FN) rate of 33% (639/1912) by choosing an operating point prioritizing PPV from the PPV-FN rate curve. XGBoost reduced 2105 Clinical Classification Software (CCS) features to 137 that identified those with true-positive (TP) CHD and false-positive FP classification. CONCLUSION: Applying ML algorithms following case selection by CHD-related ICD codes improved the accuracy of identifying TP true-positive CHD cases.

Duke Scholars

Author Rishi Kamaleswaran Trauma, Acute, and Critical Care Surgery

Published In

Birth Defects Res

DOI

10.1002/bdr2.2440

EISSN

2472-1727

Publication Date

February 2025

Volume

117

Issue

Start / End Page

e2440

Location

United States

Related Subject Headings

Male
Machine Learning
International Classification of Diseases
Humans
Heart Defects, Congenital
Female
False Positive Reactions
Algorithms

Citation

APA

Chicago

ICMJE

MLA

NLM

Shi, H., Book, W. M., Ivey, L. C., Rodriguez, F. H., Raskind-Hood, C., Downing, K. F., … Kamaleswaran, R. (2025). A Generalized Machine Learning Model for Identifying Congenital Heart Defects (CHDs) Using ICD Codes. Birth Defects Res, 117(2), e2440. https://doi.org/10.1002/bdr2.2440

Shi, Haoming, Wendy M. Book, Lindsey C. Ivey, Fred H. Rodriguez, Cheryl Raskind-Hood, Karrie F. Downing, Sherry L. Farr, et al. “A Generalized Machine Learning Model for Identifying Congenital Heart Defects (CHDs) Using ICD Codes.” Birth Defects Res 117, no. 2 (February 2025): e2440. https://doi.org/10.1002/bdr2.2440.

Shi H, Book WM, Ivey LC, Rodriguez FH, Raskind-Hood C, Downing KF, et al. A Generalized Machine Learning Model for Identifying Congenital Heart Defects (CHDs) Using ICD Codes. Birth Defects Res. 2025 Feb;117(2):e2440.

Shi, Haoming, et al. “A Generalized Machine Learning Model for Identifying Congenital Heart Defects (CHDs) Using ICD Codes.” Birth Defects Res, vol. 117, no. 2, Feb. 2025, p. e2440. Pubmed, doi:10.1002/bdr2.2440.

Shi H, Book WM, Ivey LC, Rodriguez FH, Raskind-Hood C, Downing KF, Farr SL, McCracken CE, Leedom VO, Haynes SE, Amouzou S, Sameni R, Kamaleswaran R. A Generalized Machine Learning Model for Identifying Congenital Heart Defects (CHDs) Using ICD Codes. Birth Defects Res. 2025 Feb;117(2):e2440.

Published In

Birth Defects Res

DOI

10.1002/bdr2.2440

EISSN

2472-1727

Publication Date

February 2025

Volume

117

Issue

Start / End Page

e2440

Location

United States

Related Subject Headings

Male
Machine Learning
International Classification of Diseases
Humans
Heart Defects, Congenital
Female
False Positive Reactions
Algorithms