Passer à la navigation principale Passer à la recherche Passer au contenu principal

Asynchronous Stochastic Quasi-Newton MCMC for Non-Convex Optimization

  • Umut Şimşekli
  • , Çağatay Yıldız
  • , Thanh Huy Nguyen
  • , Gaël Richard
  • , A. Taylan Cemgil
  • Université Paris-Saclay
  • Aalto University
  • Bogazici University

Résultats de recherche: Contribution à un journalArticle de conférenceRevue par des pairs

Résumé

Recent studies have illustrated that stochastic gradient Markov Chain Monte Carlo techniques have a strong potential in non-convex optimization, where local and global convergence guarantees can be shown under certain conditions. By building up on this recent theory, in this study, we develop an asynchronous-parallel stochastic L-BFGS algorithm for non-convex optimization. The proposed algorithm is suitable for both distributed and shared-memory settings. We provide formal theoretical analysis and show that the proposed method achieves an ergodic convergence rate of O(1/ N) (N being the total number of iterations) and it can achieve a linear speedup under certain conditions. We perform several experiments on both synthetic and real datasets. The results support our theory and show that the proposed algorithm provides a significant speedup over the recently proposed synchronous distributed L-BFGS algorithm.

langue originaleAnglais
Pages (de - à)4674-4683
Nombre de pages10
journalProceedings of Machine Learning Research
Volume80
étatPublié - 1 janv. 2018
Modification externeOui
Evénement35th International Conference on Machine Learning, ICML 2018 - Stockholm, Sucde
Durée: 10 juil. 201815 juil. 2018

Empreinte digitale

Examiner les sujets de recherche de « Asynchronous Stochastic Quasi-Newton MCMC for Non-Convex Optimization ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation