Résumé
Leveraging the kernel trick in both the input and output spaces, surrogate kernel methods are a flexible and theoretically grounded solution to structured output prediction. If they provide state-of-the-art performance on complex data sets of moderate size (e.g., in chemoinformatics), these approaches however fail to scale. We propose to equip surrogate kernel methods with sketching-based approximations, applied to both the input and output feature maps. We prove excess risk bounds on the original structured prediction problem, showing how to attain close-to-optimal rates with a reduced sketch size that depends on the eigendecay of the input/output covariance operators. From a computational perspective, we show that the two approximations have distinct but complementary impacts: sketching the input kernel mostly reduces training time, while sketching the output kernel decreases the inference time. Empirically, our approach is shown to scale, achieving state-of-the-art performance on benchmark data sets where non-sketched methods are intractable.
| langue originale | Anglais |
|---|---|
| Pages (de - à) | 109-117 |
| Nombre de pages | 9 |
| journal | Proceedings of Machine Learning Research |
| Volume | 238 |
| état | Publié - 1 janv. 2024 |
| Evénement | 27th International Conference on Artificial Intelligence and Statistics, AISTATS 2024 - Valencia, Espagne Durée: 2 mai 2024 → 4 mai 2024 |
Empreinte digitale
Examiner les sujets de recherche de « Sketch In, Sketch Out: Accelerating both Learning and Inference for Structured Prediction with Kernels ». Ensemble, ils forment une empreinte digitale unique.Contient cette citation
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver