Skip to main content

Using Noise to Infer Aspects of Simplicity Without Learning

Publication ,  Conference
Boner, Z; Chen, H; Semenova, L; Parr, R; Rudin, C
Published in: Advances in Neural Information Processing Systems
January 1, 2024

Noise in data significantly influences decision-making in the data science process. In fact, it has been shown that noise in data generation processes leads practitioners to find simpler models. However, an open question still remains: what is the degree of model simplification we can expect under different noise levels? In this work, we address this question by investigating the relationship between the amount of noise and model simplicity across various hypothesis spaces, focusing on decision trees and linear models. We formally show that noise acts as an implicit regularizer for several different noise models. Furthermore, we prove that Rashomon sets (sets of near-optimal models) constructed with noisy data tend to contain simpler models than corresponding Rashomon sets with non-noisy data. Additionally, we show that noise expands the set of “good” features and consequently enlarges the set of models that use at least one good feature. Our work offers theoretical guarantees and practical insights for practitioners and policymakers on whether simple-yet-accurate machine learning models are likely to exist, based on knowledge of noise levels in the data generation process.

Duke Scholars

Published In

Advances in Neural Information Processing Systems

ISSN

1049-5258

Publication Date

January 1, 2024

Volume

37

Related Subject Headings

  • 4611 Machine learning
  • 1702 Cognitive Sciences
  • 1701 Psychology
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Boner, Z., Chen, H., Semenova, L., Parr, R., & Rudin, C. (2024). Using Noise to Infer Aspects of Simplicity Without Learning. In Advances in Neural Information Processing Systems (Vol. 37).
Boner, Z., H. Chen, L. Semenova, R. Parr, and C. Rudin. “Using Noise to Infer Aspects of Simplicity Without Learning.” In Advances in Neural Information Processing Systems, Vol. 37, 2024.
Boner Z, Chen H, Semenova L, Parr R, Rudin C. Using Noise to Infer Aspects of Simplicity Without Learning. In: Advances in Neural Information Processing Systems. 2024.
Boner, Z., et al. “Using Noise to Infer Aspects of Simplicity Without Learning.” Advances in Neural Information Processing Systems, vol. 37, 2024.
Boner Z, Chen H, Semenova L, Parr R, Rudin C. Using Noise to Infer Aspects of Simplicity Without Learning. Advances in Neural Information Processing Systems. 2024.

Published In

Advances in Neural Information Processing Systems

ISSN

1049-5258

Publication Date

January 1, 2024

Volume

37

Related Subject Headings

  • 4611 Machine learning
  • 1702 Cognitive Sciences
  • 1701 Psychology