Intra-cluster correlations from the CLustered OUtcome Dataset bank to inform the design of longitudinal cluster trials.

Journal Article (Journal Article)

BACKGROUND: Sample size calculations for longitudinal cluster randomised trials, such as crossover and stepped-wedge trials, require estimates of the assumed correlation structure. This includes both within-period intra-cluster correlations, which importantly differ from conventional intra-cluster correlations by their dependence on period, and also cluster autocorrelation coefficients to model correlation decay. There are limited resources to inform these estimates. In this article, we provide a repository of correlation estimates from a bank of real-world clustered datasets. These are provided under several assumed correlation structures, namely exchangeable, block-exchangeable and discrete-time decay correlation structures. METHODS: Longitudinal studies with clustered outcomes were collected to form the CLustered OUtcome Dataset bank. Forty-four available continuous outcomes from 29 datasets were obtained and analysed using each correlation structure. Patterns of within-period intra-cluster correlation coefficient and cluster autocorrelation coefficients were explored by study characteristics. RESULTS: The median within-period intra-cluster correlation coefficient for the discrete-time decay model was 0.05 (interquartile range: 0.02-0.09) with a median cluster autocorrelation of 0.73 (interquartile range: 0.19-0.91). The within-period intra-cluster correlation coefficients were similar for the exchangeable, block-exchangeable and discrete-time decay correlation structures. Within-period intra-cluster correlation coefficients and cluster autocorrelations were found to vary with the number of participants per cluster-period, the period-length, type of cluster (primary care, secondary care, community or school) and country income status (high-income country or low- and middle-income country). The within-period intra-cluster correlation coefficients tended to decrease with increasing period-length and slightly decrease with increasing cluster-period sizes, while the cluster autocorrelations tended to move closer to 1 with increasing cluster-period size. Using the CLustered OUtcome Dataset bank, an RShiny app has been developed for determining plausible values of correlation coefficients for use in sample size calculations. DISCUSSION: This study provides a repository of intra-cluster correlations and cluster autocorrelations for longitudinal cluster trials. This can help inform sample size calculations for future longitudinal cluster randomised trials.

Full Text

Duke Authors

Cited Authors

  • Korevaar, E; Kasza, J; Taljaard, M; Hemming, K; Haines, T; Turner, EL; Thompson, JA; Hughes, JP; Forbes, AB

Published Date

  • June 4, 2021

Published In

Start / End Page

  • 17407745211020852 -

PubMed ID

  • 34088230

Electronic International Standard Serial Number (EISSN)

  • 1740-7753

Digital Object Identifier (DOI)

  • 10.1177/17407745211020852


  • eng

Conference Location

  • England