Passer à la navigation principale Passer à la recherche Passer au contenu principal

Learning to rank anomalies: scalar performance criteria and maximization of rank statistics

Résultats de recherche: Contribution à un journalArticleRevue par des pairs

Résumé

The ability to collect and store ever more massive data, unlabeled in many cases, has been accompanied by the need to process them efficiently in order to extract relevant information and possibly design solutions based on the latter. In various situations, the vast majority of the observations exhibit the same behavior, while a small proportion deviates from it. Detecting these outlier observations (or equivalently defined as anomalies) is now one of the major challenges for machine learning applications (e.g. fraud detection or predictive maintenance). We propose here a novel methodology for outlier/anomaly detection, by learning a scoring function defined on the feature space allowing for ranking the observations by degree of abnormality. The scoring function is built through maximization of an empirical performance criterion taking the form of a (two-sample) linear rank statistic. We show that bipartite ranking algorithms can thus be used to learn nearly optimal scoring function with provable theoretical guarantees. We illustrate our methodology with numerical experiments based on open access online code.

langue originaleAnglais
Pages (de - à)8623-8653
Nombre de pages31
journalMachine Learning
Volume113
Numéro de publication11
Les DOIs
étatPublié - 1 déc. 2024

Empreinte digitale

Examiner les sujets de recherche de « Learning to rank anomalies: scalar performance criteria and maximization of rank statistics ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation