Skip to main content

Interpretable Generalized Additive Models for Datasets with Missing Values

Publication ,  Conference
McTavish, H; Donnelly, J; Seltzer, M; Rudin, C
Published in: Advances in Neural Information Processing Systems
January 1, 2024

Many important datasets contain samples that are missing one or more feature values. Maintaining the interpretability of machine learning models in the presence of such missing data is challenging. Singly or multiply imputing missing values complicates the model's mapping from features to labels. On the other hand, reasoning on indicator variables that represent missingness introduces a potentially large number of additional terms, sacrificing sparsity. We solve these problems with M-GAM, a sparse, generalized, additive modeling approach that incorporates missingness indicators and their interaction terms while maintaining sparsity through ℓ0 regularization. We show that M-GAM provides similar or superior accuracy to prior methods while significantly improving sparsity relative to either imputation or naïve inclusion of indicator variables.

Duke Scholars

Published In

Advances in Neural Information Processing Systems

ISSN

1049-5258

Publication Date

January 1, 2024

Volume

37

Related Subject Headings

  • 4611 Machine learning
  • 1702 Cognitive Sciences
  • 1701 Psychology
 

Citation

APA
Chicago
ICMJE
MLA
NLM
McTavish, H., Donnelly, J., Seltzer, M., & Rudin, C. (2024). Interpretable Generalized Additive Models for Datasets with Missing Values. In Advances in Neural Information Processing Systems (Vol. 37).
McTavish, H., J. Donnelly, M. Seltzer, and C. Rudin. “Interpretable Generalized Additive Models for Datasets with Missing Values.” In Advances in Neural Information Processing Systems, Vol. 37, 2024.
McTavish H, Donnelly J, Seltzer M, Rudin C. Interpretable Generalized Additive Models for Datasets with Missing Values. In: Advances in Neural Information Processing Systems. 2024.
McTavish, H., et al. “Interpretable Generalized Additive Models for Datasets with Missing Values.” Advances in Neural Information Processing Systems, vol. 37, 2024.
McTavish H, Donnelly J, Seltzer M, Rudin C. Interpretable Generalized Additive Models for Datasets with Missing Values. Advances in Neural Information Processing Systems. 2024.

Published In

Advances in Neural Information Processing Systems

ISSN

1049-5258

Publication Date

January 1, 2024

Volume

37

Related Subject Headings

  • 4611 Machine learning
  • 1702 Cognitive Sciences
  • 1701 Psychology