Skip to main navigation Skip to search Skip to main content

New ensemble methods for evolving data streams

  • Albert Bifet
  • , Geoff Holmes
  • , Bernhard Pfahringer
  • , Richard Kirkby
  • , Ricard Gavaldà
  • Universidad Politecnica de Catalunia
  • University of Waikato

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Advanced analysis of data streams is quickly becoming a key area of data mining research as the number of applications demanding such processing increases. Online mining when such data streams evolve over time, that is when concepts drift or change completely, is becoming one of the core issues. When tackling non-stationary concepts, ensembles of classifiers have several advantages over single classifier methods: they are easy to scale and parallelize, they can adapt to change quickly by pruning under-performing parts of the ensemble, and they therefore usually also generate more accurate concept descriptions. This paper proposes a new experimental data stream framework for studying concept drift, and two new variants of Bagging: ADWIN Bagging and Adaptive-Size Hoefinding Tree (ASHT) Bagging. Using the new experimental framework, an evaluation study on synthetic and real-world datasets comprising up to ten million examples shows that the new ensemble methods perform very well compared to several known methods.

Original languageEnglish
Title of host publicationKDD '09
Subtitle of host publicationProceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
Pages139-147
Number of pages9
DOIs
Publication statusPublished - 9 Nov 2009
Externally publishedYes
Event15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '09 - Paris, France
Duration: 28 Jun 20091 Jul 2009

Publication series

NameProceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Conference

Conference15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '09
Country/TerritoryFrance
CityParis
Period28/06/091/07/09

Keywords

  • Concept drift
  • Data streams
  • Decision tree
  • Ensemble methods

Fingerprint

Dive into the research topics of 'New ensemble methods for evolving data streams'. Together they form a unique fingerprint.

Cite this