Skip to main navigation Skip to search Skip to main content

STREamRHF: Tree-Based Unsupervised Anomaly Detection for Data Streams

  • Stefan Nesic
  • , Andrian Putina
  • , Maroua Bahri
  • , Alexis Huet
  • , Jose Manuel Navarro
  • , Dario Rossi
  • , Mauro Sozio

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We present STREAMRHF, an unsupervised anomaly detection algorithm for data streams. Our algorithm builds on some of the ideas of Random Histogram Forest (RHF) [1], a state-of-the-art algorithm for batch unsupervised anomaly detection. STREAMRHF constructs a forest of decision trees, where feature splits are determined according to the kurtosis score of every feature. It irrevocably assigns an anomaly score to data points, as soon as they arrive, by means of an incremental computation of its random trees and the kurtosis scores of the features. This allows efficient online scoring and concept drift detection altogether. Our approach is tree-based which boasts several appealing properties, such as explainability of the results [2]. We conduct an extensive experimental evaluation on multiple datasets from different real-world applications. Our evaluation shows that our streaming algorithm achieves comparable average precision to RHF while outperforming state-of-the-art streaming approaches for unsupervised anomaly detection with furthermore limited computational complexity.

Original languageEnglish
Title of host publication2022 IEEE/ACS 19th International Conference on Computer Systems and Applications, AICCSA 2022 - Proceedings
PublisherIEEE Computer Society
ISBN (Electronic)9798350310085
DOIs
Publication statusPublished - 1 Jan 2022
Externally publishedYes
Event19th IEEE/ACS International Conference on Computer Systems and Applications, AICCSA 2022 - Abu Dhabi, United Arab Emirates
Duration: 5 Dec 20227 Dec 2022

Publication series

NameProceedings of IEEE/ACS International Conference on Computer Systems and Applications, AICCSA
Volume2022-December
ISSN (Print)2161-5322
ISSN (Electronic)2161-5330

Conference

Conference19th IEEE/ACS International Conference on Computer Systems and Applications, AICCSA 2022
Country/TerritoryUnited Arab Emirates
CityAbu Dhabi
Period5/12/227/12/22

Keywords

  • Anomaly detection
  • Data streams
  • Random histogram
  • Unsupervised learning

Fingerprint

Dive into the research topics of 'STREamRHF: Tree-Based Unsupervised Anomaly Detection for Data Streams'. Together they form a unique fingerprint.

Cite this