Skip to main content
Journal cover image

Supersparse linear integer models for optimized medical scoring systems

Publication ,  Journal Article
Ustun, B; Rudin, C
Published in: Machine Learning
March 1, 2016

Scoring systems are linear classification models that only require users to add, subtract and multiply a few small numbers in order to make a prediction. These models are in widespread use by the medical community, but are difficult to learn from data because they need to be accurate and sparse, have coprime integer coefficients, and satisfy multiple operational constraints. We present a new method for creating data-driven scoring systems called a Supersparse Linear Integer Model (SLIM). SLIM scoring systems are built by using an integer programming problem that directly encodes measures of accuracy (the 0–1 loss) and sparsity (the (Formula presented.) -seminorm) while restricting coefficients to coprime integers. SLIM can seamlessly incorporate a wide range of operational constraints related to accuracy and sparsity, and can produce acceptable models without parameter tuning because of the direct control provided over these quantities. We provide bounds on the testing and training accuracy of SLIM scoring systems, and present a new data reduction technique that can improve scalability by eliminating a portion of the training data beforehand. Our paper includes results from a collaboration with the Massachusetts General Hospital Sleep Laboratory, where SLIM is being used to create a highly tailored scoring system for sleep apnea screening.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Machine Learning

DOI

EISSN

1573-0565

ISSN

0885-6125

Publication Date

March 1, 2016

Volume

102

Issue

3

Start / End Page

349 / 391

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 4611 Machine learning
  • 1702 Cognitive Sciences
  • 0806 Information Systems
  • 0801 Artificial Intelligence and Image Processing
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Ustun, B., & Rudin, C. (2016). Supersparse linear integer models for optimized medical scoring systems. Machine Learning, 102(3), 349–391. https://doi.org/10.1007/s10994-015-5528-6
Ustun, B., and C. Rudin. “Supersparse linear integer models for optimized medical scoring systems.” Machine Learning 102, no. 3 (March 1, 2016): 349–91. https://doi.org/10.1007/s10994-015-5528-6.
Ustun B, Rudin C. Supersparse linear integer models for optimized medical scoring systems. Machine Learning. 2016 Mar 1;102(3):349–91.
Ustun, B., and C. Rudin. “Supersparse linear integer models for optimized medical scoring systems.” Machine Learning, vol. 102, no. 3, Mar. 2016, pp. 349–91. Scopus, doi:10.1007/s10994-015-5528-6.
Ustun B, Rudin C. Supersparse linear integer models for optimized medical scoring systems. Machine Learning. 2016 Mar 1;102(3):349–391.
Journal cover image

Published In

Machine Learning

DOI

EISSN

1573-0565

ISSN

0885-6125

Publication Date

March 1, 2016

Volume

102

Issue

3

Start / End Page

349 / 391

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 4611 Machine learning
  • 1702 Cognitive Sciences
  • 0806 Information Systems
  • 0801 Artificial Intelligence and Image Processing