Skip to main content
Journal cover image

Two parts are better than one: modeling marginal means of semicontinuous data

Publication ,  Journal Article
Smith, VA; Neelon, B; Maciejewski, ML; Preisser, JS
Published in: Health Services and Outcomes Research Methodology
December 1, 2017

In health services research, it is common to encounter semicontinuous data characterized by a point mass at zero followed by a continuous distribution with positive support. These are often analyzed using two-part mixtures that separately model the probability of use to account for the portion of the sample with zero values. Commonly, but not always, the second component models the continuous values conditional on them being positive. Prior work examining whether such two-part models are needed to appropriately draw inference from semicontinuous data compared to standard one-part regression models has found mixed results. However, prior studies have generally used only measures of model fit on a single dataset, leaving a definitive conclusion uncertain. This paper provides a detailed evaluation using simulations of the appropriateness of standard one-part generalized linear models (GLMs) compared to a recently developed marginalized two-part (MTP) model. The MTP model, unlike the one-part GLMs, explicitly accounts for the point mass at zero, yet takes the same form for the marginal mean as the commonly used GLM with log link, making the covariate effects directly comparable. We simulate data scenarios with varying sample sizes and percentages of zeros. One-part GLMs resulted in increased bias, lower than nominal coverage of confidence intervals, and inflated type I error rates, rendering them inappropriate for use with semicontinuous data. Even when distributional assumptions were violated, estimates of covariate effects and type I error rates under the MTP model remained robust.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Health Services and Outcomes Research Methodology

DOI

EISSN

1572-9400

ISSN

1387-3741

Publication Date

December 1, 2017

Volume

17

Issue

3-4

Start / End Page

198 / 218

Related Subject Headings

  • Health Policy & Services
  • 35 Commerce, management, tourism and services
  • 15 Commerce, Management, Tourism and Services
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Smith, V. A., Neelon, B., Maciejewski, M. L., & Preisser, J. S. (2017). Two parts are better than one: modeling marginal means of semicontinuous data. Health Services and Outcomes Research Methodology, 17(3–4), 198–218. https://doi.org/10.1007/s10742-017-0169-9
Smith, V. A., B. Neelon, M. L. Maciejewski, and J. S. Preisser. “Two parts are better than one: modeling marginal means of semicontinuous data.” Health Services and Outcomes Research Methodology 17, no. 3–4 (December 1, 2017): 198–218. https://doi.org/10.1007/s10742-017-0169-9.
Smith VA, Neelon B, Maciejewski ML, Preisser JS. Two parts are better than one: modeling marginal means of semicontinuous data. Health Services and Outcomes Research Methodology. 2017 Dec 1;17(3–4):198–218.
Smith, V. A., et al. “Two parts are better than one: modeling marginal means of semicontinuous data.” Health Services and Outcomes Research Methodology, vol. 17, no. 3–4, Dec. 2017, pp. 198–218. Scopus, doi:10.1007/s10742-017-0169-9.
Smith VA, Neelon B, Maciejewski ML, Preisser JS. Two parts are better than one: modeling marginal means of semicontinuous data. Health Services and Outcomes Research Methodology. 2017 Dec 1;17(3–4):198–218.
Journal cover image

Published In

Health Services and Outcomes Research Methodology

DOI

EISSN

1572-9400

ISSN

1387-3741

Publication Date

December 1, 2017

Volume

17

Issue

3-4

Start / End Page

198 / 218

Related Subject Headings

  • Health Policy & Services
  • 35 Commerce, management, tourism and services
  • 15 Commerce, Management, Tourism and Services