Skip to main navigation Skip to search Skip to main content

Non-Asymptotic Analysis of Fractional Langevin Monte Carlo for Non-Convex Optimization

  • Université Paris-Saclay

Research output: Contribution to journalConference articlepeer-review

Abstract

Recent studies on diffusion-based sampling methods have shown that Langevin Monte Carlo (LMC) algorithms can be beneficial for nonconvex optimization, and rigorous theoretical guarantees have been proven for both asymptotic and finite-time regimes. Algorithmically, LMC-based algorithms resemble the well-known gradient descent (GD) algorithm, where the GD recursion is perturbed by an additive Gaussian noise whose variance has a particular form. Fractional Langevin Monte Carlo (FLMC) is a recently proposed extension of LMC, where the Gaussian noise is replaced by a heavy-tailed αstable noise. As opposed to its Gaussian counterpart, these heavy-tailed perturbations can incur large jumps and it has been empirically demonstrated that the choice of α-stable noise can provide several advantages in modern machine learning problems, both in optimization and sampling contexts. However, as opposed to LMC, only asymptotic convergence properties of FLMC have been yet established. In this study, we analyze the non-asymptotic behavior of FLMC for nonconvex optimization and prove finite-time bounds for its expected suboptimality. Our results show that the weak-error of FLMC increases faster than LMC, which suggests using smaller step-sizes in FLMC. We finally extend our results to the case where the exact gradients are replaced by stochastic gradients and show that similar results hold in this setting as well.

Original languageEnglish
Pages (from-to)4810-4819
Number of pages10
JournalProceedings of Machine Learning Research
Volume97
Publication statusPublished - 1 Jan 2019
Externally publishedYes
Event36th International Conference on Machine Learning, ICML 2019 - Long Beach, United States
Duration: 9 Jun 201915 Jun 2019

Fingerprint

Dive into the research topics of 'Non-Asymptotic Analysis of Fractional Langevin Monte Carlo for Non-Convex Optimization'. Together they form a unique fingerprint.

Cite this