Skip to main navigation Skip to search Skip to main content

Scalability of stochastic gradient descent based on "smart" sampling techniques

  • Telecom Paris

Research output: Contribution to journalConference articlepeer-review

Abstract

Various appealing ideas have been recently proposed in the statistical literature to scale-up machine learning techniques and solve predictive/inferential problems from "Big Datasets". Beyond the massively parallelized and distributed approaches exploiting hardware architectures and programming frameworks that have received increasing interest these last few years, several variants of the Stochastic Gradient Descent (SGD) method based on "smart" sampling procedures have been designed for accelerating the model fitting stage. Such techniques exploit either the form of the objective functional or some supposedly available auxiliary information, and have been thoroughly investigated from a theoretical viewpoint. Though attractive, such statistical methods must be also analyzed from a computational perspective, bearing the possible options offered by recent technological advances in mind. It is thus of vital importance to investigate how to implement efficiently these inferential principles in order to achieve the best trade-off between computational time and accuracy. In this paper, we explore the scalability of the SGD techniques introduced in [11, 9] from an experimental perspective. Issues related to their implementation on distributed computing platforms such as Apache Spark are also discussed and experimental results based on large-scale real datasets are displayed in order to illustrate the relevance of the promoted approaches.

Original languageEnglish
Pages (from-to)308-315
Number of pages8
JournalProcedia Computer Science
Volume53
Issue number1
DOIs
Publication statusPublished - 1 Jan 2015
EventINNS Conference on Big Data 2015 - San Francisco, United States
Duration: 8 Aug 201510 Aug 2015

Keywords

  • Large-scale machine-learning
  • Sampling schemes
  • Stochastic gradient descent

Fingerprint

Dive into the research topics of 'Scalability of stochastic gradient descent based on "smart" sampling techniques'. Together they form a unique fingerprint.

Cite this