Skip to main content
Journal cover image

A conditional multi-label model to improve prediction of a rare outcome: An illustration predicting autism diagnosis.

Publication ,  Journal Article
Huang, WA; Engelhard, M; Coffman, M; Hill, ED; Weng, Q; Scheer, A; Maslow, G; Henao, R; Dawson, G; Goldstein, BA
Published in: J Biomed Inform
September 2024

OBJECTIVE: This study aimed to develop a novel approach using routinely collected electronic health records (EHRs) data to improve the prediction of a rare event. We illustrated this using an example of improving early prediction of an autism diagnosis, given its low prevalence, by leveraging correlations between autism and other neurodevelopmental conditions (NDCs). METHODS: To achieve this, we introduced a conditional multi-label model by merging conditional learning and multi-label methodologies. The conditional learning approach breaks a hard task into more manageable pieces in each stage, and the multi-label approach utilizes information from related neurodevelopmental conditions to learn predictive latent features. The study involved forecasting autism diagnosis by age 5.5 years, utilizing data from the first 18 months of life, and the analysis of feature importance correlations to explore the alignment within the feature space across different conditions. RESULTS: Upon analysis of health records from 18,156 children, we are able to generate a model that predicts a future autism diagnosis with moderate performance (AUROC=0.76). The proposed conditional multi-label method significantly improves predictive performance with an AUROC of 0.80 (p < 0.001). Further examination shows that both the conditional and multi-label approach alone provided marginal lift to the model performance compared to a one-stage one-label approach. We also demonstrated the generalizability and applicability of this method using simulated data with high correlation between feature vectors for different labels. CONCLUSION: Our findings underscore the effectiveness of the developed conditional multi-label model for early prediction of an autism diagnosis. The study introduces a versatile strategy applicable to prediction tasks involving limited target populations but sharing underlying features or etiology among related groups.

Duke Scholars

Published In

J Biomed Inform

DOI

EISSN

1532-0480

Publication Date

September 2024

Volume

157

Start / End Page

104711

Location

United States

Related Subject Headings

  • Medical Informatics
  • Male
  • Infant
  • Humans
  • Female
  • Electronic Health Records
  • Child, Preschool
  • Child
  • Biomedical Engineering
  • Autistic Disorder
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Huang, W. A., Engelhard, M., Coffman, M., Hill, E. D., Weng, Q., Scheer, A., … Goldstein, B. A. (2024). A conditional multi-label model to improve prediction of a rare outcome: An illustration predicting autism diagnosis. J Biomed Inform, 157, 104711. https://doi.org/10.1016/j.jbi.2024.104711
Huang, Wei A., Matthew Engelhard, Marika Coffman, Elliot D. Hill, Qin Weng, Abby Scheer, Gary Maslow, Ricardo Henao, Geraldine Dawson, and Benjamin A. Goldstein. “A conditional multi-label model to improve prediction of a rare outcome: An illustration predicting autism diagnosis.J Biomed Inform 157 (September 2024): 104711. https://doi.org/10.1016/j.jbi.2024.104711.
Huang WA, Engelhard M, Coffman M, Hill ED, Weng Q, Scheer A, et al. A conditional multi-label model to improve prediction of a rare outcome: An illustration predicting autism diagnosis. J Biomed Inform. 2024 Sep;157:104711.
Huang, Wei A., et al. “A conditional multi-label model to improve prediction of a rare outcome: An illustration predicting autism diagnosis.J Biomed Inform, vol. 157, Sept. 2024, p. 104711. Pubmed, doi:10.1016/j.jbi.2024.104711.
Huang WA, Engelhard M, Coffman M, Hill ED, Weng Q, Scheer A, Maslow G, Henao R, Dawson G, Goldstein BA. A conditional multi-label model to improve prediction of a rare outcome: An illustration predicting autism diagnosis. J Biomed Inform. 2024 Sep;157:104711.
Journal cover image

Published In

J Biomed Inform

DOI

EISSN

1532-0480

Publication Date

September 2024

Volume

157

Start / End Page

104711

Location

United States

Related Subject Headings

  • Medical Informatics
  • Male
  • Infant
  • Humans
  • Female
  • Electronic Health Records
  • Child, Preschool
  • Child
  • Biomedical Engineering
  • Autistic Disorder