Skip to main navigation Skip to search Skip to main content

Persistence-Based clustering in riemannian manifolds

  • INRIA
  • Stanford University

Research output: Contribution to journalArticlepeer-review

Abstract

We present a clustering scheme that combines a mode-seeking phase with a cluster merging phase in the corresponding density map. While mode detection is done by a standard graph-based hill-climbing scheme, the novelty of our approach resides in its use of topological persistence to guide the merging of clusters. Our algorithm provides additional feedback in the form of a set of points in the plane, called a persistence diagram (PD), which provably reflects the prominences of the modes of the density. In practice, this feedback enables the user to choose relevant parameter values, so that under mild sampling conditions the algorithm will output the correct number of clusters, a notion that can be made formally sound within persistence theory. In addition, the output clusters have the property that their spatial locations are bound to the ones of the basins of attraction of the peaks of the density. The algorithm only requires rough estimates of the density at the data points, and knowledge of (approximate) pairwise distances between them. It is therefore applicable in any metric space. Meanwhile, its complexity remains practical: although the size of the input distance matrix may be up to quadratic in the number of data points, a careful implementation only uses a linear amount of memory and takes barely more time to run than to read through the input. Categories and Subject Descriptors: F.2.2 [Analysis of Algorithms and Problem Complexity]: Nonnumerical Algorithms and Problems-Geometrical problems and computations, computations on discrete structures.

Original languageEnglish
Article number41
JournalJournal of the ACM
Volume60
Issue number6
DOIs
Publication statusPublished - 1 Nov 2013
Externally publishedYes

Keywords

  • Clustering
  • Computational topology
  • Mode seeking
  • Morse theory
  • Topological persistence
  • Unsupervised learning

Fingerprint

Dive into the research topics of 'Persistence-Based clustering in riemannian manifolds'. Together they form a unique fingerprint.

Cite this