Constrained Hierarchical Clustering via Graph Coarsening and Optimal Cuts

Eliabelle Mauduit, Andrea Simonetto

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Motivated by extracting and summarizing relevant information in short sentence settings, such as satisfaction questionnaires, hotel reviews, and X/Twitter, we study the problem of clustering words in a hierarchical fashion. In particular, we focus on the problem of clustering with horizontal and vertical structural constraints. Horizontal constraints are typically cannot-link and must-link among words, while vertical constraints are precedence constraints among cluster levels. We overcome state-of-the-art bottlenecks by formulating the problem in two steps: first, as a soft-constrained regularized least-squares which guides the result of a sequential graph coarsening algorithm towards the horizontal feasible set. Then, flat clusters are extracted from the resulting hierarchical tree by computing optimal cut heights based on the available constraints. We show that the resulting approach compares very well with respect to existing algorithms and is computationally light.

Original languageEnglish
Title of host publicationConference Record of the 57th Asilomar Conference on Signals, Systems and Computers, ACSSC 2023
EditorsMichael B. Matthews
PublisherIEEE Computer Society
Pages844-848
Number of pages5
ISBN (Electronic)9798350325744
DOIs
Publication statusPublished - 1 Jan 2023
Event57th Asilomar Conference on Signals, Systems and Computers, ACSSC 2023 - Pacific Grove, United States
Duration: 29 Oct 20231 Nov 2023

Publication series

NameConference Record - Asilomar Conference on Signals, Systems and Computers
ISSN (Print)1058-6393

Conference

Conference57th Asilomar Conference on Signals, Systems and Computers, ACSSC 2023
Country/TerritoryUnited States
CityPacific Grove
Period29/10/231/11/23

Keywords

  • Constrained Hierarchical Clustering
  • Graph Coarsening
  • Optimization
  • Ultrametric Spaces

Fingerprint

Dive into the research topics of 'Constrained Hierarchical Clustering via Graph Coarsening and Optimal Cuts'. Together they form a unique fingerprint.

Cite this