The One Standard Error Rule for Model Selection: Does It Work?

Yuchen Chen, Yuhong Yang

Research output: Contribution to journalArticlepeer-review

Abstract

Previous research provided a lot of discussion on the selection of regularization parameters when it comes to the application of regularization methods for high-dimensional regression. The popular “One Standard Error Rule” (1se rule) used with cross validation (CV) is to select the most parsimonious model whose prediction error is not much worse than the minimum CV error. This paper examines the validity of the 1se rule from a theoretical angle and also studies its estimation accuracy and performances in applications of regression estimation and variable selection, particularly for Lasso in a regression framework. Our theoretical result shows that when a regression procedure produces the regression estimator converging relatively fast to the true regression function, the standard error estimation formula in the 1se rule is justified asymptotically. The numerical results show the following: 1. the 1se rule in general does not necessarily provide a good estimation for the intended standard deviation of the cross validation error. The estimation bias can be 50–100% upwards or downwards in various situations; 2. the results tend to support that 1se rule usually outperforms the regular CV in sparse variable selection and alleviates the over-selection tendency of Lasso; 3. in regression estimation or prediction, the 1se rule often performs worse. In addition, comparisons are made over two real data sets: Boston Housing Prices (large sample size n, small/moderate number of variables p) and Bardet–Biedl data (large p, small n). Data guided simulations are done to provide insight on the relative performances of the 1se rule and the regular CV.
Original languageEnglish (US)
Pages (from-to)868-892
Number of pages25
JournalStats
Volume4
Issue number4
DOIs
StatePublished - Dec 2021
Externally publishedYes

Keywords

  • subsampling
  • variable selection
  • regression estimation
  • estimation accuracy
  • tuning parameter selection

Fingerprint

Dive into the research topics of 'The One Standard Error Rule for Model Selection: Does It Work?'. Together they form a unique fingerprint.

Cite this