Skip to main navigation Skip to search Skip to main content

Merit-guided dynamic feature selection filter for data streams

  • Jean Paul Barddal
  • , Fabrício Enembreck
  • , Heitor Murilo Gomes
  • , Albert Bifet
  • , Bernhard Pfahringer
  • Pontifícia Universidade Católica do Paraná
  • CNRS LTCI
  • University of Waikato

Research output: Contribution to journalArticlepeer-review

Abstract

Learning from ephemeral data streams has garnered the interest of both researchers and practitioners towards adaptive learning techniques. Despite the convincing results obtained thus far, most of the current research still overlooks that the relevance of features may change throughout the learning process. Scenarios where features become - or cease to be - relevant to the learning task are called feature drifting data streams, and the identification of which features are relevant becomes even more challenging when the feature space is high-dimensional. To select relevant features during the progress of data streams, we propose a merit-guided and classifier-independent dynamic feature selection algorithm named DynamIc SymmetriCal Uncertainty Selection for Streams (DISCUSS). We evaluate our proposal on both synthetic and real-world datasets and show that DISCUSS can boost kNN and Naive Bayes classifiers’ accuracy rates on high-dimensional data streams, while at the expense of limited processing time and memory space. Finally, the drawbacks of the proposed method are assessed, and possible future works on the topic are also discussed.

Original languageEnglish
Pages (from-to)227-242
Number of pages16
JournalExpert Systems with Applications
Volume116
DOIs
Publication statusPublished - 1 Feb 2019
Externally publishedYes

Keywords

  • Concept drift
  • Data stream classification
  • Feature selection

Fingerprint

Dive into the research topics of 'Merit-guided dynamic feature selection filter for data streams'. Together they form a unique fingerprint.

Cite this