Martingale Difference Correlation and Its Use in High-Dimensional Variable Screening

Xiaofeng Shao, Jingsi Zhang

Research output: Contribution to journalArticlepeer-review

Abstract

In this article, we propose a new metric, the so-called martingale difference correlation, to measure the departure of conditional mean independence between a scalar response variable V and a vector predictor variable U. Our metric is a natural extension of distance correlation proposed by Székely, Rizzo, and Bahirov, which is used to measure the dependence between V and U. The martingale difference correlation and its empirical counterpart inherit a number of desirable features of distance correlation and sample distance correlation, such as algebraic simplicity and elegant theoretical properties. We further use martingale difference correlation as a marginal utility to do high-dimensional variable screening to screen out variables that do not contribute to conditional mean of the response given the covariates. Further extension to conditional quantile screening is also described in detail and sure screening properties are rigorously justified. Both simulation results and real data illustrations demonstrate the effectiveness of martingale difference correlation-based screening procedures in comparison with the existing counterparts. Supplementary materials for this article are available online.

Original languageEnglish (US)
Pages (from-to)1302-1318
Number of pages17
JournalJournal of the American Statistical Association
Volume109
Issue number507
DOIs
StatePublished - Sep 2014

Keywords

  • Conditional mean
  • Feature screening
  • High-dimensional inference
  • Sure screening property

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Fingerprint Dive into the research topics of 'Martingale Difference Correlation and Its Use in High-Dimensional Variable Screening'. Together they form a unique fingerprint.

Cite this