Analyzing and Repairing Concept Drift Adaptation in Data Stream Classification

  • Ben Halstead
  • , Yun Sing Koh
  • , Patricia Riddle
  • , Russel Pears
  • , Mykola Pechenizkiy
  • , Albert Bifet
  • , Gustavo Olivares
  • , Guy Coulson

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Data collected over time often exhibit changes in distribution, or concept drift, caused by changes in hidden context relevant to the classification task, e.g. weather conditions. Adaptive learning methods are able to retain performance in changing conditions by explicitly detecting concept drift and changing the classifier used to make predictions. However, in real-world conditions, existing methods often select classifiers which poorly represent current data due to adaptation errors, where change in context is misidentified. We propose the AiRStream system, which uses a novel repair algorithm to identify and correct adaptation errors. We identify errors by periodically testing the performance of inactive classifiers. If an error is identified, a backtracking procedure repairs training done under the misidentified context. AiRStream achieves higher accuracy compared to baseline methods and selects classifiers which better match changes in context. A case study on a real-world air quality inference task shows that AiRStream is able to build a robust model of environmental conditions, allowing the adaptions made to concept drift to be analysed and related to changes in weather.

Original languageEnglish
Title of host publication2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781665420990
DOIs
Publication statusPublished - 1 Jan 2021
Event8th IEEE International Conference on Data Science and Advanced Analytics, DSAA 2021 - Virtual, Online, Portugal
Duration: 6 Oct 20219 Oct 2021

Publication series

Name2021 IEEE 8th International Conference on Data Science and Advanced Analytics, DSAA 2021

Conference

Conference8th IEEE International Conference on Data Science and Advanced Analytics, DSAA 2021
Country/TerritoryPortugal
CityVirtual, Online
Period6/10/219/10/21

Fingerprint

Dive into the research topics of 'Analyzing and Repairing Concept Drift Adaptation in Data Stream Classification'. Together they form a unique fingerprint.

Cite this