Passer à la navigation principale Passer à la recherche Passer au contenu principal

Scalability of stochastic gradient descent based on "smart" sampling techniques

  • Telecom Paris

Résultats de recherche: Contribution à un journalArticle de conférenceRevue par des pairs

Résumé

Various appealing ideas have been recently proposed in the statistical literature to scale-up machine learning techniques and solve predictive/inferential problems from "Big Datasets". Beyond the massively parallelized and distributed approaches exploiting hardware architectures and programming frameworks that have received increasing interest these last few years, several variants of the Stochastic Gradient Descent (SGD) method based on "smart" sampling procedures have been designed for accelerating the model fitting stage. Such techniques exploit either the form of the objective functional or some supposedly available auxiliary information, and have been thoroughly investigated from a theoretical viewpoint. Though attractive, such statistical methods must be also analyzed from a computational perspective, bearing the possible options offered by recent technological advances in mind. It is thus of vital importance to investigate how to implement efficiently these inferential principles in order to achieve the best trade-off between computational time and accuracy. In this paper, we explore the scalability of the SGD techniques introduced in [11, 9] from an experimental perspective. Issues related to their implementation on distributed computing platforms such as Apache Spark are also discussed and experimental results based on large-scale real datasets are displayed in order to illustrate the relevance of the promoted approaches.

langue originaleAnglais
Pages (de - à)308-315
Nombre de pages8
journalProcedia Computer Science
Volume53
Numéro de publication1
Les DOIs
étatPublié - 1 janv. 2015
EvénementINNS Conference on Big Data 2015 - San Francisco, États-Unis
Durée: 8 août 201510 août 2015

Empreinte digitale

Examiner les sujets de recherche de « Scalability of stochastic gradient descent based on "smart" sampling techniques ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation