Towards automated configuration of stream clustering algorithms

  • Matthias Carnein
  • , Heike Trautmann
  • , Albert Bifet
  • , Bernhard Pfahringer

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Clustering is an important technique in data analysis which can reveal hidden patterns and unknown relationships in the data. A common problem in clustering is the proper choice of parameter settings. To tackle this, automated algorithm configuration is available which can automatically find the best parameter settings. In practice, however, many of our today’s data sources are data streams due to the widespread deployment of sensors, the internet-of-things or (social) media. Stream clustering aims to tackle this challenge by identifying, tracking and updating clusters over time. Unfortunately, none of the existing approaches for automated algorithm configuration are directly applicable to the streaming scenario. In this paper, we explore the possibility of automated algorithm configuration for stream clustering algorithms using an ensemble of different configurations. In first experiments, we demonstrate that our approach is able to automatically find superior configurations and refine them over time.

Original languageEnglish
Title of host publicationMachine Learning and Knowledge Discovery in Databases - International Workshops of ECML PKDD 2019, Proceedings
EditorsPeggy Cellier, Kurt Driessens
PublisherSpringer
Pages137-143
Number of pages7
ISBN (Print)9783030438227
DOIs
Publication statusPublished - 1 Jan 2020
Externally publishedYes
Event19th Joint European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2019 - Wurzburg, Germany
Duration: 16 Sept 201920 Sept 2019

Publication series

NameCommunications in Computer and Information Science
Volume1167 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference19th Joint European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2019
Country/TerritoryGermany
CityWurzburg
Period16/09/1920/09/19

Keywords

  • Algorithm selection
  • Automated algorithm configuration
  • Ensemble techniques
  • Stream clustering

Fingerprint

Dive into the research topics of 'Towards automated configuration of stream clustering algorithms'. Together they form a unique fingerprint.

Cite this