Skip to main content

Nonparametric Bayesian Multiple Imputation for Incomplete Categorical Variables in Large-Scale Assessment Surveys

Publication ,  Journal Article
Si, Y; Reiter, JP
Published in: Journal of Educational and Behavioral Statistics
October 2013

In many surveys, the data comprise a large number of categorical variables that suffer from item nonresponse. Standard methods for multiple imputation, like log-linear models or sequential regression imputation, can fail to capture complex dependencies and can be difficult to implement effectively in high dimensions. We present a fully Bayesian, joint modeling approach to multiple imputation for categorical data based on Dirichlet process mixtures of multinomial distributions. The approach automatically models complex dependencies while being computationally expedient. The Dirichlet process prior distributions enable analysts to avoid fixing the number of mixture components at an arbitrary number. We illustrate repeated sampling properties of the approach using simulated data. We apply the methodology to impute missing background data in the 2007 Trends in International Mathematics and Science Study.

Duke Scholars

Published In

Journal of Educational and Behavioral Statistics

Publication Date

October 2013

Volume

38

Issue

5

Start / End Page

499 / 521

Related Subject Headings

  • Social Sciences Methods
  • 1701 Psychology
  • 1303 Specialist Studies in Education
  • 0104 Statistics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Si, Y., & Reiter, J. P. (2013). Nonparametric Bayesian Multiple Imputation for Incomplete Categorical Variables in Large-Scale Assessment Surveys. Journal of Educational and Behavioral Statistics, 38(5), 499–521.
Si, Yajuan, and Jerome P. Reiter. “Nonparametric Bayesian Multiple Imputation for Incomplete Categorical Variables in Large-Scale Assessment Surveys.” Journal of Educational and Behavioral Statistics 38, no. 5 (October 2013): 499–521.
Si Y, Reiter JP. Nonparametric Bayesian Multiple Imputation for Incomplete Categorical Variables in Large-Scale Assessment Surveys. Journal of Educational and Behavioral Statistics. 2013 Oct;38(5):499–521.
Si, Yajuan, and Jerome P. Reiter. “Nonparametric Bayesian Multiple Imputation for Incomplete Categorical Variables in Large-Scale Assessment Surveys.” Journal of Educational and Behavioral Statistics, vol. 38, no. 5, Oct. 2013, pp. 499–521.
Si Y, Reiter JP. Nonparametric Bayesian Multiple Imputation for Incomplete Categorical Variables in Large-Scale Assessment Surveys. Journal of Educational and Behavioral Statistics. 2013 Oct;38(5):499–521.

Published In

Journal of Educational and Behavioral Statistics

Publication Date

October 2013

Volume

38

Issue

5

Start / End Page

499 / 521

Related Subject Headings

  • Social Sciences Methods
  • 1701 Psychology
  • 1303 Specialist Studies in Education
  • 0104 Statistics