Abstract
Optimal transport (OT) distances and maximum mean discrepancies (MMD) are now routinely used in machine learning to compare probability measures. Our focus in this paper is on Sinkhorn divergences (SDs), a regularized variant of OT distances that can interpolate, depending on the regularization strength ", between OT (" = 0) and MMD (" = 1). Although the tradeoff induced by that regularization is by now well understood computationally (OT, SDs and MMD require respectively O(n3 log n), O(n2) and n2 operations to compare two samples of size n), much less is known in terms of the sample complexity of SDs, namely bounding the gap between the evaluation of SDs on two densities vs. samples from these densities. That complexity for OT and MMD stand at two extremes: O(1/n1/d) for OT in dimension d and O(1/pn) for MMD. that for SDs has only been studied empirically. In this paper, we (i) derive a bound on the approximation error made with SDs when approximating OT as a function of the regularizer ", (ii) prove that the optimizers of regularized OT are bounded in a Sobolev (RKHS) ball independent of the two measures and (iii) reformulate SDs as a maximization problem in a RKHS to obtain a sample complexity in 1/pn (as in MMD), with a constant that depends however on ", making the bridge between OT and MMD complete.
| Original language | English |
|---|---|
| Publication status | Published - 1 Jan 2019 |
| Externally published | Yes |
| Event | 22nd International Conference on Artificial Intelligence and Statistics, AISTATS 2019 - Naha, Japan Duration: 16 Apr 2019 → 18 Apr 2019 |
Conference
| Conference | 22nd International Conference on Artificial Intelligence and Statistics, AISTATS 2019 |
|---|---|
| Country/Territory | Japan |
| City | Naha |
| Period | 16/04/19 → 18/04/19 |
Fingerprint
Dive into the research topics of 'Sample complexity of sinkhorn divergences'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver