Gradient descent algorithms for Bures-Wasserstein barycenters

Research output: Contribution to journalConference articlepeer-review

Abstract

We study first order methods to compute the barycenter of a probability distribution P over the space of probability measures with finite second moment. We develop a framework to derive global rates of convergence for both gradient descent and stochastic gradient descent despite the fact that the barycenter functional is not geodesically convex. Our analysis overcomes this technical hurdle by employing a Polyak-Łojasiewicz (PL) inequality and relies on tools from optimal transport and metric geometry. In turn, we establish a PL inequality when P is supported on the Bures-Wasserstein manifold of Gaussian probability measures. It leads to the first global rates of convergence for first order methods in this context.

Original languageEnglish
Pages (from-to)1276-1304
Number of pages29
JournalProceedings of Machine Learning Research
Volume125
Publication statusPublished - 1 Jan 2020
Externally publishedYes
Event33rd Conference on Learning Theory, COLT 2020 - Virtual, Online, Austria
Duration: 9 Jul 202012 Jul 2020

Keywords

  • Wasserstein barycenters
  • geodesic optimization
  • optimal transport

Fingerprint

Dive into the research topics of 'Gradient descent algorithms for Bures-Wasserstein barycenters'. Together they form a unique fingerprint.

Cite this