Tight conditions for consistent variable selection in high dimensional nonparametric regression

Research output: Contribution to journalConference articlepeer-review

Abstract

We address the issue of variable selection in the regression model with very high ambient dimension, i.e., when the number of covariates is very large. The main focus is on the situation where the number of relevant covariates, called intrinsic dimension, is much smaller than the ambient dimension. Without assuming any parametric form of the underlying regression function, we get tight conditions making it possible to consistently estimate the set of relevant variables. These conditions relate the intrinsic dimension to the ambient dimension and to the sample size. The procedure that is provably consistent under these tight conditions is simple and is based on comparing the empirical Fourier coefficients with an appropriately chosen threshold value.

Original languageEnglish
Pages (from-to)187-205
Number of pages19
JournalJournal of Machine Learning Research
Volume19
Publication statusPublished - 1 Jan 2011
Externally publishedYes
Event24th International Conference on Learning Theory, COLT 2011 co located with Foundation of Computational Mathematics Conference, FOCM 2011 - Budapest, Hungary
Duration: 9 Jul 201111 Jul 2011

Keywords

  • List of keywords

Fingerprint

Dive into the research topics of 'Tight conditions for consistent variable selection in high dimensional nonparametric regression'. Together they form a unique fingerprint.

Cite this