
Multiple imputation for multivariate missing-data problems: A data analyst's perspective
Analyses of multivariate data are frequently hampered by missing values. Until recently, the only missing-data methods available to most data analysts have been relatively ad hoc practices such as listwise deletion. Recent dramatic advances in theoretical and computational statistics, however, have produced a new generation of flexible procedures with a sound statistical basis. These procedures involve multiple imputation (Rubin, 1987), a simulation technique that replaces each missing datum with a set of m > 1 plausible values. The m versions of the complete data are analyzed by standard complete-data methods, and the results are combined using simple rules to yield estimates, standard errors, and p-values that formally incorporate missing-data uncertainty. New computational algorithms and software described in a recent book (Schafer, 1997a) allow us to create proper multiple imputations in complex multivariate settings This article reviews the key ideas of multiple imputation, discusses the software programs currently available, and demonstrates their use on data from the Adolescent Alcohol Prevention Trial (Hansen & Graham, 1991).
Duke Scholars
Altmetric Attention Stats
Dimensions Citation Stats
Published In
DOI
ISSN
Publication Date
Volume
Issue
Start / End Page
Related Subject Headings
- Social Sciences Methods
- 52 Psychology
- 49 Mathematical sciences
- 35 Commerce, management, tourism and services
- 17 Psychology and Cognitive Sciences
- 15 Commerce, Management, Tourism and Services
- 01 Mathematical Sciences
Citation

Published In
DOI
ISSN
Publication Date
Volume
Issue
Start / End Page
Related Subject Headings
- Social Sciences Methods
- 52 Psychology
- 49 Mathematical sciences
- 35 Commerce, management, tourism and services
- 17 Psychology and Cognitive Sciences
- 15 Commerce, Management, Tourism and Services
- 01 Mathematical Sciences