TY - JOUR
T1 - INFERENCE FOR CHANGE POINTS IN HIGH-DIMENSIONAL DATA VIA SELFNORMALIZATION
AU - Wang, Runmin
AU - Zhu, Changbo
AU - Volgushev, Stanislav
AU - Shao, Xiaofeng
N1 - Funding. Shao’s research is partially supported by NSF Grants DMS-1807023 and DMS-2014018. Vogulshev’s research is partially supported by a discovery grant from NSERC of Canada.
PY - 2022/4
Y1 - 2022/4
N2 - This article considers change-point testing and estimation for a sequence of high-dimensional data. In the case of testing for a mean shift for highdimensional independent data, we propose a new test which is based on Ustatistic in Chen and Qin (Ann. Statist. 38 (2010) 808-835) and utilizes the self-normalization principle (Shao J. R. Stat. Soc. Ser. B. Stat. Methodol. 72 (2010) 343-366; Shao and Zhang J. Amer. Statist. Assoc. 105 (2010) 1228- 1240). Our test targets dense alternatives in the high-dimensional setting and involves no tuning parameters. To extend to change-point testing for highdimensional time series, we introduce a trimming parameter and formulate a self-normalized test statistic with trimming to accommodate the weak temporal dependence. On the theory front we derive the limiting distributions of self-normalized test statistics under both the null and alternatives for both independent and dependent high-dimensional data. At the core of our asymptotic theory, we obtain weak convergence of a sequential U-statistic based process for high-dimensional independent data, and weak convergence of sequential trimmed U-statistic based processes for high-dimensional linear processes, both of which are of independent interests. Additionally, we illustrate how our tests can be used in combination with wild binary segmentation to estimate the number and location of multiple change points. Numerical simulations demonstrate the competitiveness of our proposed testing and estimation procedures in comparison with several existing methods in the literature.
AB - This article considers change-point testing and estimation for a sequence of high-dimensional data. In the case of testing for a mean shift for highdimensional independent data, we propose a new test which is based on Ustatistic in Chen and Qin (Ann. Statist. 38 (2010) 808-835) and utilizes the self-normalization principle (Shao J. R. Stat. Soc. Ser. B. Stat. Methodol. 72 (2010) 343-366; Shao and Zhang J. Amer. Statist. Assoc. 105 (2010) 1228- 1240). Our test targets dense alternatives in the high-dimensional setting and involves no tuning parameters. To extend to change-point testing for highdimensional time series, we introduce a trimming parameter and formulate a self-normalized test statistic with trimming to accommodate the weak temporal dependence. On the theory front we derive the limiting distributions of self-normalized test statistics under both the null and alternatives for both independent and dependent high-dimensional data. At the core of our asymptotic theory, we obtain weak convergence of a sequential U-statistic based process for high-dimensional independent data, and weak convergence of sequential trimmed U-statistic based processes for high-dimensional linear processes, both of which are of independent interests. Additionally, we illustrate how our tests can be used in combination with wild binary segmentation to estimate the number and location of multiple change points. Numerical simulations demonstrate the competitiveness of our proposed testing and estimation procedures in comparison with several existing methods in the literature.
UR - http://www.scopus.com/inward/record.url?scp=85130794981&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85130794981&partnerID=8YFLogxK
U2 - 10.1214/21-AOS2127
DO - 10.1214/21-AOS2127
M3 - Article
AN - SCOPUS:85130794981
SN - 0090-5364
VL - 50
SP - 781
EP - 806
JO - Annals of Statistics
JF - Annals of Statistics
IS - 2
ER -