An improved column generation algorithm for minimum sum-of-squares clustering

Research output: Contribution to journalArticlepeer-review

Abstract

Given a set of entities associated with points in Euclidean space, minimum sum-of-squares clustering (MSSC) consists in partitioning this set into clusters such that the sum of squared distances from each point to the centroid of its cluster is minimized. A column generation algorithm for MSSC was given by du Merle et al. in SIAM Journal Scientific Computing 21:1485-1505. The bottleneck of that algorithm is the resolution of the auxiliary problem of finding a column with negative reduced cost. We propose a new way to solve this auxiliary problem based on geometric arguments. This greatly improves the efficiency of the whole algorithm and leads to exact solution of instances with over 2,300 entities, i.e.; more than 10 times as much as previously done.

Original languageEnglish
Pages (from-to)195-220
Number of pages26
JournalMathematical Programming
Volume131
Issue number1-2
DOIs
Publication statusPublished - 1 Feb 2012

Keywords

  • ACCPM
  • Clustering
  • Column generation
  • Sum-of-squares

Fingerprint

Dive into the research topics of 'An improved column generation algorithm for minimum sum-of-squares clustering'. Together they form a unique fingerprint.

Cite this