Skip to main navigation Skip to search Skip to main content

Sample complexity of sinkhorn divergences

  • Aude Genevay
  • , Lénaic Chizat
  • , Francis Bach
  • , Marco Cuturi
  • , Gabriel Peyré
  • PSL research University & IPSL
  • INRIA Institut National de Recherche en Informatique et en Automatique
  • Google Inc.

Research output: Contribution to conferencePaperpeer-review

Abstract

Optimal transport (OT) distances and maximum mean discrepancies (MMD) are now routinely used in machine learning to compare probability measures. Our focus in this paper is on Sinkhorn divergences (SDs), a regularized variant of OT distances that can interpolate, depending on the regularization strength ", between OT (" = 0) and MMD (" = 1). Although the tradeoff induced by that regularization is by now well understood computationally (OT, SDs and MMD require respectively O(n3 log n), O(n2) and n2 operations to compare two samples of size n), much less is known in terms of the sample complexity of SDs, namely bounding the gap between the evaluation of SDs on two densities vs. samples from these densities. That complexity for OT and MMD stand at two extremes: O(1/n1/d) for OT in dimension d and O(1/pn) for MMD. that for SDs has only been studied empirically. In this paper, we (i) derive a bound on the approximation error made with SDs when approximating OT as a function of the regularizer ", (ii) prove that the optimizers of regularized OT are bounded in a Sobolev (RKHS) ball independent of the two measures and (iii) reformulate SDs as a maximization problem in a RKHS to obtain a sample complexity in 1/pn (as in MMD), with a constant that depends however on ", making the bridge between OT and MMD complete.

Original languageEnglish
Publication statusPublished - 1 Jan 2019
Externally publishedYes
Event22nd International Conference on Artificial Intelligence and Statistics, AISTATS 2019 - Naha, Japan
Duration: 16 Apr 201918 Apr 2019

Conference

Conference22nd International Conference on Artificial Intelligence and Statistics, AISTATS 2019
Country/TerritoryJapan
CityNaha
Period16/04/1918/04/19

Fingerprint

Dive into the research topics of 'Sample complexity of sinkhorn divergences'. Together they form a unique fingerprint.

Cite this