Abstract
Estimating the individualized treatment effect has become one of the most popular topics in statistics and machine learning communities in recent years. Most existing methods focus on modeling the heterogeneous treatment effects for univariate outcomes. However, many biomedical studies are interested in studying multiple highly correlated endpoints at the same time. We propose a random forest model that simultaneously estimates individualized treatment effects of multivariate outcomes. We consider a popular study design where covariates and outcomes are measured both before and after the intervention. The proposed model uses oblique splitting rules to partition population space to the neighborhood that experiences distinct treatment effects. An extensive simulation study suggests that the proposed method outperforms existing methods in various nonlinear settings. We further apply the proposed method to two nutrition studies investigating the effects of food consumption on gastrointestinal microbiota composition and clinical biomarkers. The method has been implemented in a freely available R package MOTE.RF at https://github.com/boyiguo1/MOTE.RF.
Original language | English (US) |
---|---|
Pages (from-to) | 545-561 |
Number of pages | 17 |
Journal | Statistics in Biosciences |
Volume | 15 |
Issue number | 3 |
DOIs | |
State | Published - Dec 2023 |
Keywords
- Individualized treatment effect
- Microbiota
- Multivariate
- Personalized nutrition
- Random forests
ASJC Scopus subject areas
- Statistics and Probability
- Biochemistry, Genetics and Molecular Biology (miscellaneous)