Skip to main content

Improving Bayesian mixture models for multiple imputation of missing data using focused clustering

Publication ,  Journal Article
Wei, L; Reiter, JP
Published in: REVSTAT-Statistical Journal
April 1, 2018

We present a joint modeling approach for multiple imputation of missing continuous and categorical variables using Bayesian mixture models. The approach extends the idea of focused clustering, in which one separates variables into two sets before estimating the mixture model. Focus variables include variables with high rates of missingness and possibly other variables that could help improve the quality of the imputations. Non-focus variables include the remainder. In this way, one can use a rich sub-model for the focus set and a simpler model for the non-focus set, thereby concentrating fitting power on the variables with the highest rates of missingness. We present a procedure for specifying which variables with low rates of missingness to include in the focus set. We examine the performance of the imputation procedure using simulation studies based on artificial data and on data from the American Community Survey.

Duke Scholars

Published In

REVSTAT-Statistical Journal

EISSN

2183-0371

ISSN

1645-6726

Publication Date

April 1, 2018

Volume

16

Issue

2

Start / End Page

213 / 230

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Wei, L., & Reiter, J. P. (2018). Improving Bayesian mixture models for multiple imputation of missing data using focused clustering. REVSTAT-Statistical Journal, 16(2), 213–230.
Wei, L., and J. P. Reiter. “Improving Bayesian mixture models for multiple imputation of missing data using focused clustering.” REVSTAT-Statistical Journal 16, no. 2 (April 1, 2018): 213–30.
Wei L, Reiter JP. Improving Bayesian mixture models for multiple imputation of missing data using focused clustering. REVSTAT-Statistical Journal. 2018 Apr 1;16(2):213–30.
Wei, L., and J. P. Reiter. “Improving Bayesian mixture models for multiple imputation of missing data using focused clustering.” REVSTAT-Statistical Journal, vol. 16, no. 2, Apr. 2018, pp. 213–30.
Wei L, Reiter JP. Improving Bayesian mixture models for multiple imputation of missing data using focused clustering. REVSTAT-Statistical Journal. 2018 Apr 1;16(2):213–230.

Published In

REVSTAT-Statistical Journal

EISSN

2183-0371

ISSN

1645-6726

Publication Date

April 1, 2018

Volume

16

Issue

2

Start / End Page

213 / 230

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics