Résumé
We consider a reformulation of Reduced-Rank Regression (RRR) and Sparse Reduced-Rank Regression (SRRR) as a non-convex non-differentiable function of a single of the two matrices usually introduced to parametrize low-rank matrix learning problems. We study the behavior of proximal gradient algorithms for the minimization of the objective. In par-ticular, based on an analysis of the geometry of the problem, we establish that a proximal Polyak-Lojasiewicz inequality is satisfied in a neighborhood of the set of optima under a condition on the regularization parameter. We consequently derive linear convergence rates for the proximal gradient descent with line search and for related algorithms in a neighborhood of the optima. Our experiments show that our formulation leads to much faster learning algorithms for RRR and especially for SRRR.
| langue originale | Anglais |
|---|---|
| Pages (de - à) | 2415-2424 |
| Nombre de pages | 10 |
| journal | Proceedings of Machine Learning Research |
| Volume | 89 |
| état | Publié - 1 janv. 2019 |
| Evénement | 22nd International Conference on Artificial Intelligence and Statistics, AISTATS 2019 - Naha, Japon Durée: 16 avr. 2019 → 18 avr. 2019 |
Empreinte digitale
Examiner les sujets de recherche de « Fast Algorithms for Sparse Reduced-Rank Regression ». Ensemble, ils forment une empreinte digitale unique.Contient cette citation
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver