Skip to main content

Breast Cancer Risk Prediction Using Electronic Health Records

Publication ,  Conference
Wu, Y; Burnside, ES; Cox, J; Fan, J; Yuan, M; Yin, J; Peissig, P; Cobian, A; Page, D; Craven, M
Published in: Proceedings - 2017 IEEE International Conference on Healthcare Informatics, ICHI 2017
September 8, 2017

Electronic health records (EHRs) represent an underused data source that has great research and clinical potential. Our goal was to quantify the value of EHRs in breast cancer risk prediction. We conducted a retrospective case-control study, gathering patients' ICD-9 diagnosis codes from an existing EHR data repository. Based on the hierarchical structure of ICD-9 codes, which are composed of 3-5 digits, three levels of data representation were studied: level 0, using only the first 3 digits; level 1, using up to the first 4 digits; and level 2, using up to the full 5 digits of each code. We created two models to predict breast cancer one year in advance based on diagnosis codes in three levels of data representation: logistic regression (LR) and LASSO logistic regression (LR+Lasso). Area under the ROC curve (AUC) was used to assess model performance. The LR+Lasso model demonstrated significantly higher predictive performance than the LR model when using the level 2 feature representation (0.648 vs 0.603, p=0.013). For both the level 1 representation and the level 0 representation, the predictive difference between LR+Lasso and LR model was not significant, (0.634 vs 0.604, p=0.081) and (0.612 vs 0.603, p=0.523), respectively. For LR model, predictive performance changed modestly across three levels. For LR+Lasso model, predictive performance also changed modestly from the level 0 to the level 1representation (p=0.168) and from the level 1 to the level 2 representation (p=0.374). However, the level 2 representation provided significantly higher predictive performance than the level 0 representation (p=0.034). The unabridged level 2 representation of the diagnosis codes contains the most valuable information that may contribute to breast cancer risk prediction. The performance of these models demonstrates that EHR data can be used to predict breast cancer risk, which provides the possibility to personalize care in clinical practice. In the future, we will combine coded EHR data with demographic risk factors, genetic variants, and imaging features to improve breast cancer risk prediction.

Duke Scholars

Published In

Proceedings - 2017 IEEE International Conference on Healthcare Informatics, ICHI 2017

DOI

Publication Date

September 8, 2017

Start / End Page

224 / 228
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Wu, Y., Burnside, E. S., Cox, J., Fan, J., Yuan, M., Yin, J., … Craven, M. (2017). Breast Cancer Risk Prediction Using Electronic Health Records. In Proceedings - 2017 IEEE International Conference on Healthcare Informatics, ICHI 2017 (pp. 224–228). https://doi.org/10.1109/ICHI.2017.62
Wu, Y., E. S. Burnside, J. Cox, J. Fan, M. Yuan, J. Yin, P. Peissig, A. Cobian, D. Page, and M. Craven. “Breast Cancer Risk Prediction Using Electronic Health Records.” In Proceedings - 2017 IEEE International Conference on Healthcare Informatics, ICHI 2017, 224–28, 2017. https://doi.org/10.1109/ICHI.2017.62.
Wu Y, Burnside ES, Cox J, Fan J, Yuan M, Yin J, et al. Breast Cancer Risk Prediction Using Electronic Health Records. In: Proceedings - 2017 IEEE International Conference on Healthcare Informatics, ICHI 2017. 2017. p. 224–8.
Wu, Y., et al. “Breast Cancer Risk Prediction Using Electronic Health Records.” Proceedings - 2017 IEEE International Conference on Healthcare Informatics, ICHI 2017, 2017, pp. 224–28. Scopus, doi:10.1109/ICHI.2017.62.
Wu Y, Burnside ES, Cox J, Fan J, Yuan M, Yin J, Peissig P, Cobian A, Page D, Craven M. Breast Cancer Risk Prediction Using Electronic Health Records. Proceedings - 2017 IEEE International Conference on Healthcare Informatics, ICHI 2017. 2017. p. 224–228.

Published In

Proceedings - 2017 IEEE International Conference on Healthcare Informatics, ICHI 2017

DOI

Publication Date

September 8, 2017

Start / End Page

224 / 228