Passer à la navigation principale Passer à la recherche Passer au contenu principal

On convergence-diagnostic based step sizes for stochastic gradient descent

  • Theory of Machine Learning lab
  • Ecole Polytechnique

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Constant step-size Stochastic Gradient Descent exhibits two phases: a transient phase during which iterates make fast progress towards the optimum, followed by a stationary phase during which iterates oscillate around the optimal point. In this paper, we show that efficiently detecting this transition and appropriately decreasing the step size can lead to fast convergence rates. We analyse the classical statistical test proposed by Pflug (1983), based on the inner product between consecutive stochastic gradients. Even in the simple case where the objective function is quadratic we show that this test cannot lead to an adequate convergence diagnostic. We then propose a novel and simple statistical procedure that accurately detects stationarity and we provide experimental results showing state-of-the-art performance on synthetic and real-world datasets.

langue originaleAnglais
titre37th International Conference on Machine Learning, ICML 2020
rédacteurs en chefHal Daume, Aarti Singh
EditeurInternational Machine Learning Society (IMLS)
Pages7597-7607
Nombre de pages11
ISBN (Electronique)9781713821120
étatPublié - 1 janv. 2020
Modification externeOui
Evénement37th International Conference on Machine Learning, ICML 2020 - Virtual, Online
Durée: 13 juil. 202018 juil. 2020

Série de publications

Nom37th International Conference on Machine Learning, ICML 2020
VolumePartF168147-10

Une conférence

Une conférence37th International Conference on Machine Learning, ICML 2020
La villeVirtual, Online
période13/07/2018/07/20

Empreinte digitale

Examiner les sujets de recherche de « On convergence-diagnostic based step sizes for stochastic gradient descent ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation