Skip to main content
Journal cover image

Multiple imputation for missing data via sequential regression trees.

Publication ,  Journal Article
Burgette, LF; Reiter, JP
Published in: American journal of epidemiology
November 2010

Multiple imputation is particularly well suited to deal with missing data in large epidemiologic studies, because typically these studies support a wide range of analyses by many data users. Some of these analyses may involve complex modeling, including interactions and nonlinear relations. Identifying such relations and encoding them in imputation models, for example, in the conditional regressions for multiple imputation via chained equations, can be daunting tasks with large numbers of categorical and continuous variables. The authors present a nonparametric approach for implementing multiple imputation via chained equations by using sequential regression trees as the conditional models. This has the potential to capture complex relations with minimal tuning by the data imputer. Using simulations, the authors demonstrate that the method can result in more plausible imputations, and hence more reliable inferences, in complex settings than the naive application of standard sequential regression imputation techniques. They apply the approach to impute missing values in data on adverse birth outcomes with more than 100 clinical and survey variables. They evaluate the imputations using posterior predictive checks with several epidemiologic analyses of interest.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

American journal of epidemiology

DOI

EISSN

1476-6256

ISSN

0002-9262

Publication Date

November 2010

Volume

172

Issue

9

Start / End Page

1070 / 1076

Related Subject Headings

  • Statistics, Nonparametric
  • Multivariate Analysis
  • Humans
  • Evidence-Based Medicine
  • Epidemiology
  • Epidemiologic Studies
  • Epidemiologic Methods
  • Data Interpretation, Statistical
  • Data Collection
  • Computer Simulation
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Burgette, L. F., & Reiter, J. P. (2010). Multiple imputation for missing data via sequential regression trees. American Journal of Epidemiology, 172(9), 1070–1076. https://doi.org/10.1093/aje/kwq260
Burgette, Lane F., and Jerome P. Reiter. “Multiple imputation for missing data via sequential regression trees.American Journal of Epidemiology 172, no. 9 (November 2010): 1070–76. https://doi.org/10.1093/aje/kwq260.
Burgette LF, Reiter JP. Multiple imputation for missing data via sequential regression trees. American journal of epidemiology. 2010 Nov;172(9):1070–6.
Burgette, Lane F., and Jerome P. Reiter. “Multiple imputation for missing data via sequential regression trees.American Journal of Epidemiology, vol. 172, no. 9, Nov. 2010, pp. 1070–76. Epmc, doi:10.1093/aje/kwq260.
Burgette LF, Reiter JP. Multiple imputation for missing data via sequential regression trees. American journal of epidemiology. 2010 Nov;172(9):1070–1076.
Journal cover image

Published In

American journal of epidemiology

DOI

EISSN

1476-6256

ISSN

0002-9262

Publication Date

November 2010

Volume

172

Issue

9

Start / End Page

1070 / 1076

Related Subject Headings

  • Statistics, Nonparametric
  • Multivariate Analysis
  • Humans
  • Evidence-Based Medicine
  • Epidemiology
  • Epidemiologic Studies
  • Epidemiologic Methods
  • Data Interpretation, Statistical
  • Data Collection
  • Computer Simulation