Two-sample and change-point inference for non-Euclidean valued time series

Research output: Contribution to journalArticlepeer-review

Abstract

Data objects taking value in a general metric space have become increasingly common in modern data analysis. In this paper, we study two important statistical inference problems, namely, two-sample testing and change-point detection, for such non-Euclidean data under temporal dependence. Typical examples of non-Euclidean valued time series include yearly mortality distributions, time-varying networks, and covariance matrix time series. To accommodate unknown temporal dependence, we ad-vance the self-normalization (SN) technique [22] to the inference of non-Euclidean time series, which is substantially different from the existing SN-based inference for functional time series that reside in Hilbert space [33]. Theoretically, we propose new regularity conditions that could be easier to check than those in the recent literature, and derive the limiting distributions of the proposed test statistics under both null and local alternatives. For change-point detection problem, we also derive the consistency for the change-point location estimator, and combine our proposed change-point test with wild binary segmentation to perform multiple change-point esti-mation. Numerical simulations demonstrate the effectiveness and robust-ness of our proposed tests compared with existing methods in the literature. Finally, we apply our tests to two-sample inference in mortality data and change-point detection in cryptocurrency data.

Original languageEnglish (US)
Pages (from-to)848-894
Number of pages47
JournalElectronic Journal of Statistics
Volume18
Issue number1
DOIs
StatePublished - 2024

Keywords

  • Change-point detection
  • Fréchet mean
  • functional data
  • random objects
  • temporal dependence

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'Two-sample and change-point inference for non-Euclidean valued time series'. Together they form a unique fingerprint.

Cite this