Missing-data methods for generalized linear models: A comparative review

Journal Article (Review;Journal)

Missing data is a major issue in many applied problems, especially in the biomedical sciences. We review four common approaches for inference in generalized linear models (GLMs) with missing covariate data: maximum likelihood (ML), multiple imputation (MI), fully Bayesian (FB), and weighted estimating equations (WEEs). There is considerable interest in how these four methodologies are related, the properties of each approach, the advantages and disadvantages of each methodology, and computational implementation. We examine data that are missing at random and nonignorable missing. For ML, we focus on techniques using the EM algorithm, and in particular, discuss the EM by the method of weights and related procedures as discussed by Ibrahim. For MI, we examine the techniques developed by Rubin. For FB, we review approaches considered by Ibrahim et al. For WEE, we focus on the techniques developed by Robins et al. We use a real dataset and a detailed simulation study to compare the four methods. © 2005 American Statistical Association.

Full Text

Duke Authors

Cited Authors

  • Ibrahim, JG; Chen, MH; Lipsitz, SR; Herring, AH

Published Date

  • March 1, 2005

Published In

Volume / Issue

  • 100 / 469

Start / End Page

  • 332 - 346

International Standard Serial Number (ISSN)

  • 0162-1459

Digital Object Identifier (DOI)

  • 10.1198/016214504000001844

Citation Source

  • Scopus