Droplet Ensemble Learning on Drifting Data Streams

Pierre Xavier Loeffel, Albert Bifet, Christophe Marsala, Marcin Detyniecki

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Ensemble learning methods for evolving data streams are extremely powerful learning methods since they combine the predictions of a set of classifiers, to improve the performance of the best single classifier inside the ensemble. In this paper we introduce the Droplet Ensemble Algorithm (DEA), a new method for learning on data streams subject to concept drifts which combines ensemble and instance based learning. Contrarily to state of the art ensemble methods which select the base learners according to their performances on recent observations, DEA dynamically selects the subset of base learners which is the best suited for the region of the feature space where the latest observation was received. Experiments on 25 datasets (most of which being commonly used as benchmark in the literature) reproducing different type of drifts show that this new method achieves excellent results on accuracy and ranking against SAM KNN [1], all of its base learners and a majority vote algorithm using the same base learners.

Original languageEnglish
Title of host publicationAdvances in Intelligent Data Analysis XVI - 16th International Symposium, IDA 2017, Proceedings
EditorsNiall Adams, Allan Tucker, David Weston
PublisherSpringer Verlag
Pages210-222
Number of pages13
ISBN (Print)9783319687643
DOIs
Publication statusPublished - 1 Jan 2017
Externally publishedYes
Event16th International Symposium on Intelligent Data Analysis, IDA 2017 - London, United Kingdom
Duration: 26 Oct 201728 Oct 2017

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10584 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th International Symposium on Intelligent Data Analysis, IDA 2017
Country/TerritoryUnited Kingdom
CityLondon
Period26/10/1728/10/17

Keywords

  • Concept drift
  • Data streams
  • Ensemble learning
  • Online-learning
  • Supervised learning

Fingerprint

Dive into the research topics of 'Droplet Ensemble Learning on Drifting Data Streams'. Together they form a unique fingerprint.

Cite this