Adaptive Testing for High-Dimensional Data

  • Yangfan Zhang
  • , Runmin Wang
  • , Xiaofeng Shao

Research output: Contribution to journalArticlepeer-review

Abstract

In this article, we propose a class of (Formula presented.) -norm based U-statistics for a family of global testing problems related to high-dimensional data. This includes testing of mean vector and its spatial sign, simultaneous testing of linear model coefficients, and testing of component-wise independence for high-dimensional observations, among others. Under the null hypothesis, we derive asymptotic normality and independence between (Formula presented.) -norm based U-statistics for several qs under mild moment and cumulant conditions. A simple combination of two studentized (Formula presented.) -based test statistics via their p-values is proposed and is shown to attain great power against alternatives of different sparsity. Our work is a substantial extension of He et al., which is mostly focused on mean and covariance testing, and we manage to provide a general treatment of asymptotic independence of (Formula presented.) -norm based U-statistics for a wide class of kernels. To alleviate the computation burden, we introduce a variant of the proposed U-statistics by using the monotone indices in the summation, resulting in a U-statistic with asymmetric kernel. A dynamic programming method is introduced to reduce the computational cost from (Formula presented.), which is required for the calculation of the full U-statistic, to (Formula presented.) where r is the order of the kernel. Numerical results further corroborate the advantage of the proposed adaptive test as compared to some existing competitors. Supplementary materials for this article are available online, including a standardized description of the materials available for reproducing the work.

Original languageEnglish (US)
Pages (from-to)1893-1905
Number of pages13
JournalJournal of the American Statistical Association
Volume120
Issue number551
Early online dateFeb 18 2025
DOIs
StatePublished - 2025

Keywords

  • Independence testing
  • Simultaneous testing
  • Spatial sign
  • U-statistics

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Fingerprint

Dive into the research topics of 'Adaptive Testing for High-Dimensional Data'. Together they form a unique fingerprint.

Cite this