Passer à la navigation principale Passer à la recherche Passer au contenu principal

Sample Complexity of Sinkhorn Divergences

  • Aude Genevay
  • , Lénaic Chizat
  • , Francis Bach
  • , Marco Cuturi
  • , Gabriel Peyré
  • PSL research University & IPSL
  • INRIA Institut National de Recherche en Informatique et en Automatique
  • ENSAE
  • CNRS

Résultats de recherche: Contribution à un journalArticle de conférenceRevue par des pairs

Résumé

Optimal transport (OT) distances and maximum mean discrepancies (MMD) are now routinely used in machine learning to compare probability measures. Our focus in this paper is on Sinkhorn divergences (SDs), a regularized variant of OT distances that can interpolate, depending on the regularization strength ε, between OT (ε = 0) and MMD (ε = ∞). Although the tradeoff induced by that regularization is by now well understood computationally (OT, SDs and MMD require respectively O(n3log n), O(n2) and n2operations to compare two samples of size n), much less is known in terms of the sample complexity of SDs, namely bounding the gap between the evaluation of SDs on two densities vs. samples from these densities. That complexity for OT and MMD stand at two extremes: O(1/n1/d) for OT in dimension d and O(1/√n) for MMD. that for SDs has only been studied empirically. In this paper, we (i) derive a bound on the approximation error made with SDs when approximating OT as a function of the regularizer ε, (ii) prove that the optimizers of regularized OT are bounded in a Sobolev (RKHS) ball independent of the two measures and (iii) reformulate SDs as a maximization problem in a RKHS to obtain a sample complexity in 1/√n (as in MMD), with a constant that depends however on ε, making the bridge between OT and MMD complete.

langue originaleAnglais
Pages (de - à)1574-1583
Nombre de pages10
journalProceedings of Machine Learning Research
Volume89
étatPublié - 1 janv. 2019
Modification externeOui
Evénement22nd International Conference on Artificial Intelligence and Statistics, AISTATS 2019 - Naha, Japon
Durée: 16 avr. 201918 avr. 2019

Empreinte digitale

Examiner les sujets de recherche de « Sample Complexity of Sinkhorn Divergences ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation