Leveraging bagging for evolving data streams

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Bagging, boosting and Random Forests are classical ensemble methods used to improve the performance of single classifiers. They obtain superior performance by increasing the accuracy and diversity of the single classifiers. Attempts have been made to reproduce these methods in the more challenging context of evolving data streams. In this paper, we propose a new variant of bagging, called leveraging bagging. This method combines the simplicity of bagging with adding more randomization to the input, and output of the classifiers. We test our method by performing an evaluation study on synthetic and real-world datasets comprising up to ten million examples.

Original languageEnglish
Title of host publicationMachine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2010, Proceedings
PublisherSpringer Verlag
Pages135-150
Number of pages16
EditionPART 1
ISBN (Print)364215879X, 9783642158797
DOIs
Publication statusPublished - 1 Jan 2010
Externally publishedYes
EventEuropean Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2010 - Barcelona, Spain
Duration: 20 Sept 201024 Sept 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 1
Volume6321 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceEuropean Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2010
Country/TerritorySpain
CityBarcelona
Period20/09/1024/09/10

Fingerprint

Dive into the research topics of 'Leveraging bagging for evolving data streams'. Together they form a unique fingerprint.

Cite this