Passer à la navigation principale Passer à la recherche Passer au contenu principal

SNEkhorn: Dimension Reduction with Symmetric Entropic Affinities

  • CNRS UMR 5669, 'Unité de Mathématiques Pures et Appliquées' and project-team Inria NUMED, Ecole Normale Supérieure de Lyon
  • Ecole Normale Supérieure de Lyon
  • IRDL

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Many approaches in machine learning rely on a weighted graph to encode the similarities between samples in a dataset. Entropic affinities (EAs), which are notably used in the popular Dimensionality Reduction (DR) algorithm t-SNE, are particular instances of such graphs. To ensure robustness to heterogeneous sampling densities, EAs assign a kernel bandwidth parameter to every sample in such a way that the entropy of each row in the affinity matrix is kept constant at a specific value, whose exponential is known as perplexity. EAs are inherently asymmetric and row-wise stochastic, but they are used in DR approaches after undergoing heuristic symmetrization methods that violate both the row-wise constant entropy and stochasticity properties. In this work, we uncover a novel characterization of EA as an optimal transport problem, allowing a natural symmetrization that can be computed efficiently using dual ascent. The corresponding novel affinity matrix derives advantages from symmetric doubly stochastic normalization in terms of clustering performance, while also effectively controlling the entropy of each row thus making it particularly robust to varying noise levels. Following, we present a new DR algorithm, SNEkhorn, that leverages this new affinity matrix. We show its clear superiority to existing approaches with several indicators on both synthetic and real-world datasets.

langue originaleAnglais
titreAdvances in Neural Information Processing Systems 36 - 37th Conference on Neural Information Processing Systems, NeurIPS 2023
rédacteurs en chefA. Oh, T. Neumann, A. Globerson, K. Saenko, M. Hardt, S. Levine
EditeurNeural information processing systems foundation
ISBN (Electronique)9781713899921
étatPublié - 1 janv. 2023
Evénement37th Conference on Neural Information Processing Systems, NeurIPS 2023 - New Orleans, États-Unis
Durée: 10 déc. 202316 déc. 2023

Série de publications

NomAdvances in Neural Information Processing Systems
Volume36
ISSN (imprimé)1049-5258

Une conférence

Une conférence37th Conference on Neural Information Processing Systems, NeurIPS 2023
Pays/TerritoireÉtats-Unis
La villeNew Orleans
période10/12/2316/12/23

Empreinte digitale

Examiner les sujets de recherche de « SNEkhorn: Dimension Reduction with Symmetric Entropic Affinities ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation