TY - JOUR
T1 - Inference via Robust Optimal Transportation
T2 - Theory and Methods
AU - Ma, Yiming
AU - Liu, Hang
AU - La Vecchia, Davide
AU - Lerasle, Matthieu
N1 - Publisher Copyright:
© 2025 International Statistical Institute.
PY - 2025/1/1
Y1 - 2025/1/1
N2 - Optimal transportation (OT) is widely applied in statistics and machine learning. Despite its popularity, inference based on OT has some issues. For instance, it is sensitive to outliers and may not be even defined when the underlying model has infinite moments. To cope with these problems, first, we consider a robust version of the primal transportation problem and show that it defines the robust Wasserstein distance, (Formula presented.), depending on a tuning parameter (Formula presented.). Second, we illustrate the link between 1-Wasserstein distance (Formula presented.) and (Formula presented.) and study its key measure theoretic aspects. Third, we derive some concentration inequalities for (Formula presented.). Fourth, we use (Formula presented.) to define minimum distance estimators, provide their statistical guarantees and illustrate how to apply the concentration inequalities for a selection of (Formula presented.). Fifth, we provide the dual form of the robust optimal transportation (ROBOT) and apply it to machine learning problems. We review, in a unified perspective, the key aspects of OT and ROBOT, while complementing the existing results with our new methodological findings. Numerical exercises provide evidence of the benefits of our novel methods.
AB - Optimal transportation (OT) is widely applied in statistics and machine learning. Despite its popularity, inference based on OT has some issues. For instance, it is sensitive to outliers and may not be even defined when the underlying model has infinite moments. To cope with these problems, first, we consider a robust version of the primal transportation problem and show that it defines the robust Wasserstein distance, (Formula presented.), depending on a tuning parameter (Formula presented.). Second, we illustrate the link between 1-Wasserstein distance (Formula presented.) and (Formula presented.) and study its key measure theoretic aspects. Third, we derive some concentration inequalities for (Formula presented.). Fourth, we use (Formula presented.) to define minimum distance estimators, provide their statistical guarantees and illustrate how to apply the concentration inequalities for a selection of (Formula presented.). Fifth, we provide the dual form of the robust optimal transportation (ROBOT) and apply it to machine learning problems. We review, in a unified perspective, the key aspects of OT and ROBOT, while complementing the existing results with our new methodological findings. Numerical exercises provide evidence of the benefits of our novel methods.
KW - concentration inequalities
KW - minimum distance estimation
KW - robust Wasserstein distance
KW - robust generative adversarial networks
UR - https://www.scopus.com/pages/publications/105009326618
U2 - 10.1111/insr.70000
DO - 10.1111/insr.70000
M3 - Article
AN - SCOPUS:105009326618
SN - 0306-7734
JO - International Statistical Review
JF - International Statistical Review
ER -