Passer à la navigation principale Passer à la recherche Passer au contenu principal

Sample complexity of sinkhorn divergences

  • Aude Genevay
  • , Lénaic Chizat
  • , Francis Bach
  • , Marco Cuturi
  • , Gabriel Peyré
  • PSL research University & IPSL
  • INRIA Institut National de Recherche en Informatique et en Automatique
  • Google Inc.

Résultats de recherche: Contribution à une conférencePapierRevue par des pairs

Résumé

Optimal transport (OT) distances and maximum mean discrepancies (MMD) are now routinely used in machine learning to compare probability measures. Our focus in this paper is on Sinkhorn divergences (SDs), a regularized variant of OT distances that can interpolate, depending on the regularization strength ", between OT (" = 0) and MMD (" = 1). Although the tradeoff induced by that regularization is by now well understood computationally (OT, SDs and MMD require respectively O(n3 log n), O(n2) and n2 operations to compare two samples of size n), much less is known in terms of the sample complexity of SDs, namely bounding the gap between the evaluation of SDs on two densities vs. samples from these densities. That complexity for OT and MMD stand at two extremes: O(1/n1/d) for OT in dimension d and O(1/pn) for MMD. that for SDs has only been studied empirically. In this paper, we (i) derive a bound on the approximation error made with SDs when approximating OT as a function of the regularizer ", (ii) prove that the optimizers of regularized OT are bounded in a Sobolev (RKHS) ball independent of the two measures and (iii) reformulate SDs as a maximization problem in a RKHS to obtain a sample complexity in 1/pn (as in MMD), with a constant that depends however on ", making the bridge between OT and MMD complete.

langue originaleAnglais
étatPublié - 1 janv. 2019
Modification externeOui
Evénement22nd International Conference on Artificial Intelligence and Statistics, AISTATS 2019 - Naha, Japon
Durée: 16 avr. 201918 avr. 2019

Une conférence

Une conférence22nd International Conference on Artificial Intelligence and Statistics, AISTATS 2019
Pays/TerritoireJapon
La villeNaha
période16/04/1918/04/19

Empreinte digitale

Examiner les sujets de recherche de « Sample complexity of sinkhorn divergences ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation