Adaptive Random Forests with Resampling for Imbalanced data Streams

  • Luis Eduardo Boiko Ferreira
  • , Heitor Murilo Gomes
  • , Albert Bifet
  • , Luiz S. Oliveira

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The large volume of data generated by computer networks, smartphones, wearables and a wide range of sensors, which produce real-time data, are only useful if they can be efficiently processed so that individuals can make timely decisions based on them. In this context, machine learning techniques are widely used. While it performs better than humans in such tasks, every machine learning algorithm has a certain intrinsic bias, which means they assume that the data have specific characteristics, such as having a balanced distribution between classes. As many real-world applications present imbalanced traits in their data, this topic is gaining repercussion over time. In this work, we present the Adaptive Random Forest with Resampling (ARFRE), which is a classifier designed to deal with imbalanced datasets. ARFRE resample the instances based on the current class label distribution. We show through a set of extensive experiments on seven datasets that the proposed method can considerably improve the performance of the minority class(es) while avoiding degrading the performance in the majority class. On top of that, ARFRE is more efficient regarding execution time in comparison to the standard ARF algorithm.

Original languageEnglish
Title of host publication2019 International Joint Conference on Neural Networks, IJCNN 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728119854
DOIs
Publication statusPublished - 1 Jul 2019
Event2019 International Joint Conference on Neural Networks, IJCNN 2019 - Budapest, Hungary
Duration: 14 Jul 201919 Jul 2019

Publication series

NameProceedings of the International Joint Conference on Neural Networks
Volume2019-July

Conference

Conference2019 International Joint Conference on Neural Networks, IJCNN 2019
Country/TerritoryHungary
CityBudapest
Period14/07/1919/07/19

Keywords

  • adaptive random forest
  • data streams
  • ensemble
  • imbalance
  • resampling

Fingerprint

Dive into the research topics of 'Adaptive Random Forests with Resampling for Imbalanced data Streams'. Together they form a unique fingerprint.

Cite this