Passer à la navigation principale Passer à la recherche Passer au contenu principal

Label noise (stochastic) gradient descent implicitly solves the Lasso for quadratic parametrisation

  • EPFL
  • École des Ponts

Résultats de recherche: Contribution à un journalArticle de conférenceRevue par des pairs

Résumé

Understanding the implicit bias of training algorithms is of crucial importance in order to explain the success of overparametrised neural networks. In this paper, we study the role of the label noise in the training dynamics of a quadratically parametrised model through its continuous time version. We explicitly characterise the solution chosen by the stochastic flow and prove that it implicitly solves a Lasso program. To fully complete our analysis, we provide nonasymptotic convergence guarantees for the dynamics as well as conditions for support recovery. We also give experimental results which support our theoretical claims. Our findings highlight the fact that structured noise can induce better generalisation and help explain the greater performances of stochastic dynamics as observed in practice.

langue originaleAnglais
Pages (de - à)2127-2159
Nombre de pages33
journalProceedings of Machine Learning Research
Volume178
étatPublié - 1 janv. 2022
Modification externeOui
Evénement35th Conference on Learning Theory, COLT 2022 - Hybrid, London, Royaume-Uni
Durée: 2 juil. 20225 juil. 2022

Empreinte digitale

Examiner les sujets de recherche de « Label noise (stochastic) gradient descent implicitly solves the Lasso for quadratic parametrisation ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation