Dating the break in high-dimensional data

Runmin Wang, Xiaofeng Shao

Research output: Contribution to journalArticlepeer-review

Abstract

This paper is concerned with estimation and inference for the location of a change point in the mean of independent high-dimensional data. Our change point location estimator maximizes a new U-statistic based objective function, and its convergence rate and asymptotic distribution after suitable centering and normalization are obtained under mild assumptions. Our estimator turns out to have better efficiency as compared to the least squares based counterpart in the literature. Based on the asymptotic theory, we construct a confidence interval by plugging in consistent estimates of several quantities in the normalization. We also provide a bootstrap-based confidence interval and state its asymptotic validity under suitable conditions. Through simulation studies, we demonstrate favorable finite sample performance of the new change point location estimator as compared to its least squares based counterpart, and our bootstrap-based confidence intervals, as compared to several existing competitors. The asymptotic theory based on high-dimensional U-statistic is substantially different from those developed in the literature and is of independent interest. The usefulness of our bootstrap-based confidence interval is illustrated in a genomics data set.

Original languageEnglish (US)
Pages (from-to)2879-2901
Number of pages23
JournalBernoulli
Volume29
Issue number4
DOIs
StatePublished - Nov 2023

Keywords

  • Change point detection
  • U-statistic
  • high dimension
  • structural break

ASJC Scopus subject areas

  • Statistics and Probability

Fingerprint

Dive into the research topics of 'Dating the break in high-dimensional data'. Together they form a unique fingerprint.

Cite this