Skip to main navigation Skip to search Skip to main content

Quantitative bounds for concentration-of-measure inequalities and empirical regression: The independent case

  • Université Paris-Saclay

Research output: Contribution to journalArticlepeer-review

Abstract

This paper is devoted to the study of the deviation of the (random) average L2−error associated to the least-squares regressor over a family of functions Fn (with controlled complexity) obtained from n independent, but not necessarily identically distributed, samples of explanatory and response variables, from the minimal (deterministic) average L2−error associated to this family of functions, and to some of the corresponding consequences for the problem of consistency. In the i.i.d. case, this specializes as classical questions on least-squares regression problems, but in more general cases, this setting permits a precise investigation in the direction of the study of nonasymptotic errors for least-squares regression schemes in nonstationary settings, which we motivate providing background and examples. More precisely, we prove first two nonasymptotic deviation inequalities that generalize and refine corresponding known results in the i.i.d. case. We then explore some consequences for nonasymptotic bounds of the error both in the weak and the strong senses. Finally, we exploit these estimates to shed new light into questions of consistency for least-squares regression schemes in the distribution-free, nonparametric setting. As an application to the classical theory, we provide in particular a result that generalizes the link between the problem of consistency and the Glivenko-Cantelli property, which applied to regression in the i.i.d. setting over non-decreasing families (Fn)n of functions permits to create a scheme which is strongly consistent in L2 under the sole (necessary) assumption of the existence of functions in ∪nFn which are arbitrarily close in L2 to the corresponding regressor.

Original languageEnglish
Pages (from-to)45-81
Number of pages37
JournalJournal of Complexity
Volume52
DOIs
Publication statusPublished - 1 Jun 2019
Externally publishedYes

Keywords

  • Concentration inequalities
  • Consistency
  • Distribution-free estimates
  • Empirical processes
  • Empirical regression
  • Uniform deviation probability

Fingerprint

Dive into the research topics of 'Quantitative bounds for concentration-of-measure inequalities and empirical regression: The independent case'. Together they form a unique fingerprint.

Cite this