CD-MOA: Change detection framework for massive online analysis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Analysis of data from networked digital information systems such as mobile devices, remote sensors, and streaming applications, needs to deal with two challenges: the size of data and the capacity to be adaptive to changes in concept in real-time. Many approaches meet the challenge by using an explicit change detector alongside a classification algorithm and then evaluate performance using classification accuracy. However, there is an unexpected connection between change detectors and classification methods that needs to be acknowledged. The phenomenon has been observed previously, connecting high classification performance with high false positive rates. The implication is that we need to be careful to evaluate systems against intended outcomes-high classification rates, low false alarm rates, compromises between the two and so forth. This paper proposes a new experimental framework for evaluating change detection methods against intended outcomes. The framework is general in the sense that it can be used with other data mining tasks such as frequent item and pattern mining, clustering etc. Included in the framework is a new measure of performance of a change detector that monitors the compromise between fast detection and false alarms. Using this new experimental framework we conduct an evaluation study on synthetic and real-world datasets to show that classification performance is indeed a poor proxy for change detection performance and provide further evidence that classification performance is correlated strongly with the use of change detectors that produce high false positive rates.

Original languageEnglish
Title of host publicationAdvances in Intelligent Data Analysis XII - 12th International Symposium, IDA 2013, Proceedings
Pages92-103
Number of pages12
DOIs
Publication statusPublished - 11 Nov 2013
Externally publishedYes
Event12th International Symposium on Intelligent Data Analysis, IDA 2013 - London, United Kingdom
Duration: 17 Oct 201319 Oct 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8207 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference12th International Symposium on Intelligent Data Analysis, IDA 2013
Country/TerritoryUnited Kingdom
CityLondon
Period17/10/1319/10/13

Keywords

  • data streams
  • dynamic
  • evolving
  • incremental
  • online

Fingerprint

Dive into the research topics of 'CD-MOA: Change detection framework for massive online analysis'. Together they form a unique fingerprint.

Cite this