Fast and lightweight binary and multi-branch Hoeffding Tree Regressors

Saulo Martiello Mastelini, Jacob Montiel, Heitor Murilo Gomes, Albert Bifet, Bernhard Pfahringer, Andre C.P.L.F. De Carvalho

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Incremental Hoeffding Tree Regressors (HTR) are powerful non-linear online learning tools. However, the commonly used strategy to build such structures limits their applicability to real-time scenarios. In this paper, we expand and evaluate Quantization Observer (QO), a feature discretization-based tool to speed up incremental regression tree construction and save memory resources. We enhance the original QO proposal to create multi-branch trees when dealing with numerical attributes, creating a mix of interval and binary splits rather than binary splits only. We evaluate the multi-branch and strictly binary QO-based HTRs against other tree-building strategies in an extensive experimental setup of 15 data streams. In general, the QO-based HTRs are as accurate as traditional HTRs, incurring one-third of training time at only a fraction of the memory resource usage. The obtained numerical multi-branch HTRs are shallower than the strictly binary ones, significantly faster to train, and they keep predictive performance similar to the traditional incremental trees.

Original languageEnglish
Title of host publicationProceedings - 21st IEEE International Conference on Data Mining Workshops, ICDMW 2021
EditorsBing Xue, Mykola Pechenizkiy, Yun Sing Koh
PublisherIEEE Computer Society
Pages380-388
Number of pages9
ISBN (Electronic)9781665424271
DOIs
Publication statusPublished - 1 Jan 2021
Externally publishedYes
Event21st IEEE International Conference on Data Mining Workshops, ICDMW 2021 - Virtual, Online, New Zealand
Duration: 7 Dec 202110 Dec 2021

Publication series

NameIEEE International Conference on Data Mining Workshops, ICDMW
Volume2021-December
ISSN (Print)2375-9232
ISSN (Electronic)2375-9259

Conference

Conference21st IEEE International Conference on Data Mining Workshops, ICDMW 2021
Country/TerritoryNew Zealand
CityVirtual, Online
Period7/12/2110/12/21

Keywords

  • Hoeffding tree regressor
  • computational resource savings
  • incremental learning
  • online learning

Fingerprint

Dive into the research topics of 'Fast and lightweight binary and multi-branch Hoeffding Tree Regressors'. Together they form a unique fingerprint.

Cite this