TY - JOUR
T1 - Model-Robust and Efficient Covariate Adjustment for Cluster-Randomized Experiments
AU - Wang, Bingkai
AU - Park, Chan
AU - Small, Dylan S.
AU - Li, Fan
N1 - Research in this article was supported by Patient-Centered Outcomes Research Institute Awards\u00AE (PCORI\u00AE Awards ME-2020C3-21072, ME-2022C2-27676) and National Institute of Allergy and Infectious Diseases (NIAID) grants R01AI148127, K99AI173395. The statements presented are solely the responsibility of the authors and do not necessarily represent the official views of PCORI\u00AE, its Board of Governors or Methodology Committee, or NIAID.
PY - 2024
Y1 - 2024
N2 - Cluster-randomized experiments are increasingly used to evaluate interventions in routine practice conditions, and researchers often adopt model-based methods with covariate adjustment in the statistical analyses. However, the validity of model-based covariate adjustment remains unclear when the working models are misspecified, leading to ambiguity of estimands and risk of bias. In this article, we first adapt two model-based methods—generalized estimating equations and linear mixed models—with weighted g-computation to achieve robust inference for cluster-average and individual-average treatment effects. To further overcome the limitations of model-based covariate adjustment methods, we propose efficient estimators for each estimand that allow for flexible covariate adjustment and additionally address cluster size variation dependent on treatment assignment and other cluster characteristics. Such cluster size variations often occur post-randomization and, if ignored, can lead to bias of model-based estimators. For our proposed covariate-adjusted estimators, we prove that when the nuisance functions are consistently estimated by machine learning algorithms, the estimators are consistent, asymptotically normal, and efficient. When the nuisance functions are estimated via parametric working models, the estimators are triply-robust. Simulation studies and analyses of three real-world cluster-randomized experiments demonstrate that the proposed methods are superior to existing alternatives. Supplementary materials for this article are available online.
AB - Cluster-randomized experiments are increasingly used to evaluate interventions in routine practice conditions, and researchers often adopt model-based methods with covariate adjustment in the statistical analyses. However, the validity of model-based covariate adjustment remains unclear when the working models are misspecified, leading to ambiguity of estimands and risk of bias. In this article, we first adapt two model-based methods—generalized estimating equations and linear mixed models—with weighted g-computation to achieve robust inference for cluster-average and individual-average treatment effects. To further overcome the limitations of model-based covariate adjustment methods, we propose efficient estimators for each estimand that allow for flexible covariate adjustment and additionally address cluster size variation dependent on treatment assignment and other cluster characteristics. Such cluster size variations often occur post-randomization and, if ignored, can lead to bias of model-based estimators. For our proposed covariate-adjusted estimators, we prove that when the nuisance functions are consistently estimated by machine learning algorithms, the estimators are consistent, asymptotically normal, and efficient. When the nuisance functions are estimated via parametric working models, the estimators are triply-robust. Simulation studies and analyses of three real-world cluster-randomized experiments demonstrate that the proposed methods are superior to existing alternatives. Supplementary materials for this article are available online.
KW - Causal inference
KW - Cluster-randomized trial
KW - Covariate adjustment
KW - Efficient influence function
KW - Estimands
KW - Machine learning
UR - http://www.scopus.com/inward/record.url?scp=85181938410&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85181938410&partnerID=8YFLogxK
U2 - 10.1080/01621459.2023.2289693
DO - 10.1080/01621459.2023.2289693
M3 - Article
C2 - 39911293
AN - SCOPUS:85181938410
SN - 0162-1459
VL - 119
SP - 2959
EP - 2971
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 548
ER -