Skip to main content
Journal cover image

Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors.

Publication ,  Journal Article
Harrell, FE; Lee, KL; Mark, DB
Published in: Stat Med
February 28, 1996

Multivariable regression models are powerful tools that are used frequently in studies of clinical outcomes. These models can use a mixture of categorical and continuous variables and can handle partially observed (censored) responses. However, uncritical application of modelling techniques can result in models that poorly fit the dataset at hand, or, even more likely, inaccurately predict outcomes on new subjects. One must know how to measure qualities of a model's fit in order to avoid poorly fitted or overfitted models. Measurement of predictive accuracy can be difficult for survival time data in the presence of censoring. We discuss an easily interpretable index of predictive discrimination as well as methods for assessing calibration of predicted survival probabilities. Both types of predictive accuracy should be unbiasedly validated using bootstrapping or cross-validation, before using predictions in a new data series. We discuss some of the hazards of poorly fitted and overfitted regression models and present one modelling strategy that avoids many of the problems discussed. The methods described are applicable to all regression models, but are particularly needed for binary, ordinal, and time-to-event outcomes. Methods are illustrated with a survival analysis in prostate cancer using Cox regression.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Stat Med

DOI

ISSN

0277-6715

Publication Date

February 28, 1996

Volume

15

Issue

4

Start / End Page

361 / 387

Location

England

Related Subject Headings

  • Treatment Outcome
  • Survival Analysis
  • Statistics & Probability
  • Software
  • Regression Analysis
  • Prostatic Neoplasms
  • Multivariate Analysis
  • Models, Statistical
  • Mathematical Computing
  • Male
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Harrell, F. E., Lee, K. L., & Mark, D. B. (1996). Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med, 15(4), 361–387. https://doi.org/10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO;2-4
Harrell, F. E., K. L. Lee, and D. B. Mark. “Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors.Stat Med 15, no. 4 (February 28, 1996): 361–87. https://doi.org/10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO;2-4.
Harrell, F. E., et al. “Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors.Stat Med, vol. 15, no. 4, Feb. 1996, pp. 361–87. Pubmed, doi:10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO;2-4.
Journal cover image

Published In

Stat Med

DOI

ISSN

0277-6715

Publication Date

February 28, 1996

Volume

15

Issue

4

Start / End Page

361 / 387

Location

England

Related Subject Headings

  • Treatment Outcome
  • Survival Analysis
  • Statistics & Probability
  • Software
  • Regression Analysis
  • Prostatic Neoplasms
  • Multivariate Analysis
  • Models, Statistical
  • Mathematical Computing
  • Male