Skip to main content

RECONCILING DESIGN-BASED AND MODEL-BASED CAUSAL INFERENCES FOR SPLIT-PLOT EXPERIMENTS

Publication ,  Journal Article
Zhao, A; Ding, P
Published in: Annals of Statistics
April 1, 2022

The split-plot design arose from agricultural science with experimental units, also known as the subplots, nested within groups known as the whole plots. It assigns different interventions at the whole-plot and subplot levels, respectively, providing a convenient way to accommodate hard-to-change factors. By design, subplots within the same whole plot receive the same level of the whole-plot intervention, and thereby induce a group structure on the final treatment assignments. A common strategy is to run an ordinary least squares (OLS) regression of the outcome on the treatment indicators coupled with the robust standard errors clustered at the whole-plot level. It does not give consistent estimators for the treatment effects of interest when the whole-plot sizes vary. Another common strategy is to fit a linear mixed-effects model of the outcome with normal random effects and errors. It is a purely model-based approach and can be sensitive to violations of the parametric assumptions. In contrast, design-based inference assumes no outcome models and relies solely on the controllable randomization mechanism determined by the physical experiment. We first extend the existing design-based inference based on the Horvitz-Thompson estimator to the Hajek estimator, and establish the finite-population central limit theorem for both under split-plot randomization. We then reconcile the results with those under the model-based approach, and propose two regression strategies, namely (i) the weighted least squares (WLS) fit of the unit-level data based on the inverse probability weighting and (ii) the OLS fit of the aggregate data based on whole-plot total outcomes, to reproduce the Hajek and Horvitz- Thompson estimators, respectively. This, together with the asymptotic conservativeness of the corresponding cluster-robust covariances for estimating the true design-based covariances as we establish in the process, justifies the validity of the regression estimators for design-based inference. In light of the flexibility of regression formulation for covariate adjustment, we further extend the theory to the case with covariates, and demonstrate the efficiency gain by regression-based covariate adjustment via both asymptotic theory and simulation. Importantly, all our theories are either numeric or design-based, and hold regardless of how well the regression equations represent the true data generating process.

Duke Scholars

Published In

Annals of Statistics

DOI

EISSN

2168-8966

ISSN

0090-5364

Publication Date

April 1, 2022

Volume

50

Issue

2

Start / End Page

1170 / 1192

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 3802 Econometrics
  • 1403 Econometrics
  • 0104 Statistics
  • 0102 Applied Mathematics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Zhao, A., & Ding, P. (2022). RECONCILING DESIGN-BASED AND MODEL-BASED CAUSAL INFERENCES FOR SPLIT-PLOT EXPERIMENTS. Annals of Statistics, 50(2), 1170–1192. https://doi.org/10.1214/21-AOS2144
Zhao, A., and P. Ding. “RECONCILING DESIGN-BASED AND MODEL-BASED CAUSAL INFERENCES FOR SPLIT-PLOT EXPERIMENTS.” Annals of Statistics 50, no. 2 (April 1, 2022): 1170–92. https://doi.org/10.1214/21-AOS2144.
Zhao A, Ding P. RECONCILING DESIGN-BASED AND MODEL-BASED CAUSAL INFERENCES FOR SPLIT-PLOT EXPERIMENTS. Annals of Statistics. 2022 Apr 1;50(2):1170–92.
Zhao, A., and P. Ding. “RECONCILING DESIGN-BASED AND MODEL-BASED CAUSAL INFERENCES FOR SPLIT-PLOT EXPERIMENTS.” Annals of Statistics, vol. 50, no. 2, Apr. 2022, pp. 1170–92. Scopus, doi:10.1214/21-AOS2144.
Zhao A, Ding P. RECONCILING DESIGN-BASED AND MODEL-BASED CAUSAL INFERENCES FOR SPLIT-PLOT EXPERIMENTS. Annals of Statistics. 2022 Apr 1;50(2):1170–1192.

Published In

Annals of Statistics

DOI

EISSN

2168-8966

ISSN

0090-5364

Publication Date

April 1, 2022

Volume

50

Issue

2

Start / End Page

1170 / 1192

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 3802 Econometrics
  • 1403 Econometrics
  • 0104 Statistics
  • 0102 Applied Mathematics