confstream: automated algorithm selection and configuration of stream clustering algorithms

  • Matthias Carnein
  • , Heike Trautmann
  • , Albert Bifet
  • , Bernhard Pfahringer

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Machine learning has become one of the most important tools in data analysis. However, selecting the most appropriate machine learning algorithm and tuning its hyperparameters to their optimal values remains a difficult task. This is even more difficult for streaming applications where automated approaches are often not available to help during algorithm selection and configuration. This paper proposes the first approach for automated algorithm selection and configuration of stream clustering algorithms. We train an ensemble of different stream clustering algorithms and configurations in parallel and use the best performing configuration to obtain a clustering solution. By drawing new configurations from better performing ones, we are able to improve the ensemble performance over time. In large experiments on real and artificial data we show how our ensemble approach can improve upon default configurations and can also compete with a-posteriori algorithm configuration. Our approach is considerably faster than a-posteriori approaches and applicable in real-time. In addition, it is not limited to stream clustering and can be generalised to all streaming applications, including stream classification and regression.

Original languageEnglish
Title of host publicationLearning and Intelligent Optimization - 14th International Conference, LION 14, 2020, Revised Selected Papers
EditorsIlias S. Kotsireas, Panos M. Pardalos
PublisherSpringer
Pages80-95
Number of pages16
ISBN (Print)9783030535513
DOIs
Publication statusPublished - 1 Jan 2020
Externally publishedYes
Event14th International Conference on Learning and Intelligent Optimization, LION 2020 - Athens, Greece
Duration: 24 May 202028 May 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12096 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th International Conference on Learning and Intelligent Optimization, LION 2020
Country/TerritoryGreece
CityAthens
Period24/05/2028/05/20

Keywords

  • Algorithm configuration
  • Algorithm selection
  • Automated Machine Learning
  • Data streams
  • Stream clustering

Fingerprint

Dive into the research topics of 'confstream: automated algorithm selection and configuration of stream clustering algorithms'. Together they form a unique fingerprint.

Cite this