Skip to main navigation Skip to search Skip to main content

Learning to rank anomalies: scalar performance criteria and maximization of rank statistics

  • ENAC-IIC-GEL
  • Institut Polytechnique de Paris

Research output: Contribution to journalArticlepeer-review

Abstract

The ability to collect and store ever more massive data, unlabeled in many cases, has been accompanied by the need to process them efficiently in order to extract relevant information and possibly design solutions based on the latter. In various situations, the vast majority of the observations exhibit the same behavior, while a small proportion deviates from it. Detecting these outlier observations (or equivalently defined as anomalies) is now one of the major challenges for machine learning applications (e.g. fraud detection or predictive maintenance). We propose here a novel methodology for outlier/anomaly detection, by learning a scoring function defined on the feature space allowing for ranking the observations by degree of abnormality. The scoring function is built through maximization of an empirical performance criterion taking the form of a (two-sample) linear rank statistic. We show that bipartite ranking algorithms can thus be used to learn nearly optimal scoring function with provable theoretical guarantees. We illustrate our methodology with numerical experiments based on open access online code.

Original languageEnglish
Pages (from-to)8623-8653
Number of pages31
JournalMachine Learning
Volume113
Issue number11
DOIs
Publication statusPublished - 1 Dec 2024

Keywords

  • Anomaly ranking
  • Bipartite ranking
  • Novelty detection
  • Two-sample linear rank statistics

Fingerprint

Dive into the research topics of 'Learning to rank anomalies: scalar performance criteria and maximization of rank statistics'. Together they form a unique fingerprint.

Cite this