Skip to main content

How Your Location Relates to Health: Variable Importance and Interpretable Machine Learning for Environmental and Sociodemographic Data

Publication ,  Conference
Maitra, I; Lin, R; Chen, E; Donnelly, J; Šćepanović, S; Rudin, C
Published in: Proceedings of the Aaai Conference on Artificial Intelligence
April 11, 2025

Health outcomes depend on complex environmental and sociodemographic factors whose effects change over location and time. Only recently has fine-grained spatial and temporal data become available to study these effects, namely the MEDSAT dataset of English health, environmental, and sociodemographic information. Leveraging this new resource, we use a variety of variable importance techniques to robustly identify the most informative predictors across multiple health outcomes. We then develop an interpretable machine learning framework based on Generalized Additive Models (GAMs) and Multiscale Geographically Weighted Regression (MGWR) to analyze both local and global spatial dependencies of each variable on various health outcomes. Our findings identify NO2 as a global predictor for asthma, hypertension, and anxiety, alongside other outcome-specific predictors related to occupation, marriage, and vegetation. Regional analyses reveal local variations with air pollution and solar radiation, with notable shifts during COVID. This comprehensive approach provides actionable insights for addressing health disparities, and advocates for the integration of interpretable machine learning in public health.

Duke Scholars

Published In

Proceedings of the Aaai Conference on Artificial Intelligence

DOI

EISSN

2374-3468

ISSN

2159-5399

Publication Date

April 11, 2025

Volume

39

Issue

27

Start / End Page

28240 / 28248
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Maitra, I., Lin, R., Chen, E., Donnelly, J., Šćepanović, S., & Rudin, C. (2025). How Your Location Relates to Health: Variable Importance and Interpretable Machine Learning for Environmental and Sociodemographic Data. In Proceedings of the Aaai Conference on Artificial Intelligence (Vol. 39, pp. 28240–28248). https://doi.org/10.1609/aaai.v39i27.35044
Maitra, I., R. Lin, E. Chen, J. Donnelly, S. Šćepanović, and C. Rudin. “How Your Location Relates to Health: Variable Importance and Interpretable Machine Learning for Environmental and Sociodemographic Data.” In Proceedings of the Aaai Conference on Artificial Intelligence, 39:28240–48, 2025. https://doi.org/10.1609/aaai.v39i27.35044.
Maitra I, Lin R, Chen E, Donnelly J, Šćepanović S, Rudin C. How Your Location Relates to Health: Variable Importance and Interpretable Machine Learning for Environmental and Sociodemographic Data. In: Proceedings of the Aaai Conference on Artificial Intelligence. 2025. p. 28240–8.
Maitra, I., et al. “How Your Location Relates to Health: Variable Importance and Interpretable Machine Learning for Environmental and Sociodemographic Data.” Proceedings of the Aaai Conference on Artificial Intelligence, vol. 39, no. 27, 2025, pp. 28240–48. Scopus, doi:10.1609/aaai.v39i27.35044.
Maitra I, Lin R, Chen E, Donnelly J, Šćepanović S, Rudin C. How Your Location Relates to Health: Variable Importance and Interpretable Machine Learning for Environmental and Sociodemographic Data. Proceedings of the Aaai Conference on Artificial Intelligence. 2025. p. 28240–28248.

Published In

Proceedings of the Aaai Conference on Artificial Intelligence

DOI

EISSN

2374-3468

ISSN

2159-5399

Publication Date

April 11, 2025

Volume

39

Issue

27

Start / End Page

28240 / 28248