Skip to main navigation Skip to search Skip to main content

On wasserstein two-sample testing and related families of nonparametric tests

  • University of California, Berkeley
  • Women and Infants Hospital of Rhode Island-Warren Alpert Medical School of Brown University
  • ENSAE

Research output: Contribution to journalArticlepeer-review

Abstract

Nonparametric two-sample or homogeneity testing is a decision theoretic problem that involves identifying differences between two random variables without making parametric assumptions about their underlying distributions. The literature is old and rich, with a wide variety of statistics having being designed and analyzed, both for the unidimensional and the multivariate setting. In this short survey, we focus on test statistics that involve theWasserstein distance. Using an entropic smoothing of the Wasserstein distance, we connect these to very different tests including multivariate methods involving energy statistics and kernel based maximum mean discrepancy and univariate methods like the Kolmogorov-Smirnov test, probability or quantile (PP/QQ) plots and receiver operating characteristic or ordinal dominance (ROC/ODC) curves. Some observations are implicit in the literature, while others seem to have not been noticed thus far. Given nonparametric two-sample testing's classical and continued importance, we aim to provide useful connections for theorists and practitioners familiar with one subset of methods but not others.

Original languageEnglish
Article number47
JournalEntropy
Volume19
Issue number2
DOIs
Publication statusPublished - 1 Jan 2017
Externally publishedYes

Keywords

  • Energy distance
  • Entropic smoothing
  • Maximum mean discrepancy
  • QQ and PP plots
  • ROC and ODC curves
  • Two-sample testing
  • Wasserstein distance

Fingerprint

Dive into the research topics of 'On wasserstein two-sample testing and related families of nonparametric tests'. Together they form a unique fingerprint.

Cite this