Passer à la navigation principale Passer à la recherche Passer au contenu principal

Open-Loop Control: The Stochastic Gradient Method

  • École des ponts

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionChapitreRevue par des pairs

Résumé

The stochastic gradient method has a rather long history. The method foundations were given by (Robbins and Monro, Annals of Mathematical Statistics, 22:400-407, (1951) [129]) on the one hand, and by (Kiefer and Wolfowitz, Annals of Mathematical Statistics, 23:462-466, (1952) [93]) on the other. Later on, (Polyak, Automation and Remote Control, 37(12):1858-1868, (1976) [120], Polyak and Tsypkin, Automation and Remote Control, 40(3):378-389, (1979) [123]) gave results about the convergence rate. Based on this work, (Dodu et al., Bulletin de la Direction des Études et Recherches EDF, (1981) [57]) studied the optimality of the stochastic gradient algorithm, that is, the asymptotic efficiency of the associated estimator. An important contribution by (Polyak, Automation and Remote Control, 51(7):937-946, (1990) [121], Polyak and Juditsky, SIAM Journal on Control and Optimization, 30(4):838-855, (1992) [122]) has been to combine stochastic gradient method and averaging techniques in order to reach the optimal efficiency.

langue originaleAnglais
titreProbability Theory and Stochastic Modelling
EditeurSpringer Nature
Pages27-62
Nombre de pages36
Les DOIs
étatPublié - 1 janv. 2015

Série de publications

NomProbability Theory and Stochastic Modelling
Volume75
ISSN (imprimé)2199-3130
ISSN (Electronique)2199-3149

Empreinte digitale

Examiner les sujets de recherche de « Open-Loop Control: The Stochastic Gradient Method ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation