Abstract
We consider binary classification problems with positive definite kernels and square loss, and study the convergence rates of stochastic gradient methods. We show that while the excess testing loss (squared loss) converges slowly to zero as the number of observations (and thus iterations) goes to infinity, the testing error (classification error) converges exponentially fast if low-noise conditions are assumed. To achieve these rates of convergence we show sharper high-probability bounds with respect to the number of observations for stochastic gradient descent.
| Original language | English |
|---|---|
| Pages (from-to) | 250-296 |
| Number of pages | 47 |
| Journal | Proceedings of Machine Learning Research |
| Volume | 75 |
| Publication status | Published - 1 Jan 2018 |
| Externally published | Yes |
| Event | 31st Annual Conference on Learning Theory, COLT 2018 - Stockholm, Sweden Duration: 6 Jul 2018 → 9 Jul 2018 |
Keywords
- SGD
- binary classification
- margin condition
- positive-definite kernels
Fingerprint
Dive into the research topics of 'Exponential Convergence of Testing Error for Stochastic Gradient Methods'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver