Learning from time-changing data with adaptive windowing

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We present a new approach for dealing with distribution change and concept drift when learning from data sequences that may vary with time. We use sliding windows whose size, instead of being fixed a priori, is recomputed online according to the rate of change observed from the data in the window itself. This delivers the user or programmer from having to guess a time-scale for change. Contrary to many related works, we provide rigorous guarantees of performance, as bounds on the rates of false positives and false negatives. Using ideas from data stream algorithmics, we develop a time-and memory-efficient version of this algorithm, called ADWIN2. We show how to combine ADWIN2 with the Naïve Bayes (NB) predictor, in two ways: one, using it to monitor the error rate of the current model and declare when revision is necessary and, two, putting it inside the NB predictor to maintain up-to-date estimations of conditional probabilities in the data. We test our approach using synthetic and real data streams and compare them to both fixed-size and variable-size window strategies with good results.

Original languageEnglish
Title of host publicationProceedings of the 7th SIAM International Conference on Data Mining
PublisherSociety for Industrial and Applied Mathematics Publications
Pages443-448
Number of pages6
ISBN (Print)9780898716306
DOIs
Publication statusPublished - 1 Jan 2007
Externally publishedYes
Event7th SIAM International Conference on Data Mining - Minneapolis, MN, United States
Duration: 26 Apr 200728 Apr 2007

Publication series

NameProceedings of the 7th SIAM International Conference on Data Mining

Conference

Conference7th SIAM International Conference on Data Mining
Country/TerritoryUnited States
CityMinneapolis, MN
Period26/04/0728/04/07

Keywords

  • Concept and distribution drift
  • Data streams
  • Naïve bayes
  • Time-changing data

Fingerprint

Dive into the research topics of 'Learning from time-changing data with adaptive windowing'. Together they form a unique fingerprint.

Cite this