Abstract

Estimating the individualized treatment effect has become one of the most popular topics in statistics and machine learning communities in recent years. Most existing methods focus on modeling the heterogeneous treatment effects for univariate outcomes. However, many biomedical studies are interested in studying multiple highly correlated endpoints at the same time. We propose a random forest model that simultaneously estimates individualized treatment effects of multivariate outcomes. We consider a popular study design where covariates and outcomes are measured both before and after the intervention. The proposed model uses oblique splitting rules to partition population space to the neighborhood that experiences distinct treatment effects. An extensive simulation study suggests that the proposed method outperforms existing methods in various nonlinear settings. We further apply the proposed method to two nutrition studies investigating the effects of food consumption on gastrointestinal microbiota composition and clinical biomarkers. The method has been implemented in a freely available R package MOTE.RF at https://github.com/boyiguo1/MOTE.RF.

Original languageEnglish (US)
Pages (from-to)545-561
Number of pages17
JournalStatistics in Biosciences
Volume15
Issue number3
DOIs
StatePublished - Dec 2023

Keywords

  • Individualized treatment effect
  • Microbiota
  • Multivariate
  • Personalized nutrition
  • Random forests

ASJC Scopus subject areas

  • Statistics and Probability
  • Biochemistry, Genetics and Molecular Biology (miscellaneous)

Fingerprint

Dive into the research topics of 'Estimating Heterogeneous Treatment Effect on Multivariate Responses Using Random Forests'. Together they form a unique fingerprint.

Cite this