Anomaly detection using data depth: multivariate case

Research output: Contribution to journalReview articlepeer-review

Abstract

Anomaly detection is a branch of data analysis and machine learning which aims at identifying observations that exhibit abnormal behavior. Be it measurement errors, disease development, severe weather, production quality default(s) (items) or failed equipment, financial frauds or crisis events, their on-time identification, isolation, and explanation constitute an important task in almost any branch of science and industry. By providing a robust ordering, data depth—statistical function that measures belongingness of any point of the space to a data set—becomes a particularly useful tool for detection of anomalies. Already known for its theoretical properties, data depth has undergone substantial computational developments in the last decade and particularly recent years, which has made it applicable for contemporary-sized problems of data analysis and machine learning. In this article, data depth is studied as an efficient anomaly detection tool, assigning abnormality labels to observations with lower depth values, in a multivariate setting. Practical questions of necessity and reasonability of invariances and shape of the depth function, its robustness and computational complexity, choice of the threshold are discussed. Illustrations include use cases that underline advantageous behavior of data depth in various settings.

Original languageEnglish
Pages (from-to)5171-5196
Number of pages26
JournalInternational Journal of Data Science and Analytics
Volume20
Issue number6
DOIs
Publication statusPublished - 1 Nov 2025

Keywords

  • Affine invariance
  • Anomaly detection
  • Computational statistics
  • Data analysis
  • Data depth
  • Halfspace depth
  • Projection depth
  • Robustness
  • Visualization

Fingerprint

Dive into the research topics of 'Anomaly detection using data depth: multivariate case'. Together they form a unique fingerprint.

Cite this