TY - GEN
T1 - Explainable anomaly detection on high-dimensional time series data
AU - Rad, Bijan
AU - Song, Fei
AU - Jacob, Vincent
AU - Diao, Yanlei
N1 - Publisher Copyright:
© 2021 Owner/Author.
PY - 2021/6/28
Y1 - 2021/6/28
N2 - As enterprise information systems are collecting event streams from various sources, the ability of a system to automatically detect anomalous events and further provide human-readable explanations is of paramount importance. In this paper, we present an approach to integrated anomaly detection (AD) and explanation discovery (ED), which is able to leverage state-of-the-art Deep Learning (DL) techniques for anomaly detection, while being able to recover human-readable explanations for detected anomalies. At the core of the framework is a new human-interpretable dimensionality reduction (HIDR) method that not only reduces the dimensionality of the data, but also maintains a meaningful mapping from the original features to the transformed low-dimensional features. Such transformed features can be fed into any DL technique designed for anomaly detection, and the feature mapping will be used to recover human-readable explanations through a suite of new feature selection and explanation discovery methods. Evaluation using a recent explainable anomaly detection benchmark demonstrates the efficiency and effectiveness of HIDR for AD, and the result that while all three recent ED techniques failed to generate quality explanations on high-dimensional data, our HIDR-based ED framework can enable them to generate explanations with dramatic improvements in the quality of explanations and computational efficiency.
AB - As enterprise information systems are collecting event streams from various sources, the ability of a system to automatically detect anomalous events and further provide human-readable explanations is of paramount importance. In this paper, we present an approach to integrated anomaly detection (AD) and explanation discovery (ED), which is able to leverage state-of-the-art Deep Learning (DL) techniques for anomaly detection, while being able to recover human-readable explanations for detected anomalies. At the core of the framework is a new human-interpretable dimensionality reduction (HIDR) method that not only reduces the dimensionality of the data, but also maintains a meaningful mapping from the original features to the transformed low-dimensional features. Such transformed features can be fed into any DL technique designed for anomaly detection, and the feature mapping will be used to recover human-readable explanations through a suite of new feature selection and explanation discovery methods. Evaluation using a recent explainable anomaly detection benchmark demonstrates the efficiency and effectiveness of HIDR for AD, and the result that while all three recent ED techniques failed to generate quality explanations on high-dimensional data, our HIDR-based ED framework can enable them to generate explanations with dramatic improvements in the quality of explanations and computational efficiency.
KW - anomaly detection
KW - dimensionality reduction
KW - explanation discovery
KW - neural networks
KW - time series data analysis
U2 - 10.1145/3465480.3468292
DO - 10.1145/3465480.3468292
M3 - Conference contribution
AN - SCOPUS:85110329291
T3 - DEBS 2021 - Proceedings of the 15th ACM International Conference on Distributed and Event-Based Systems
SP - 142
EP - 147
BT - DEBS 2021 - Proceedings of the 15th ACM International Conference on Distributed and Event-Based Systems
A2 - Margara, Alessandro
A2 - Della Valle, Emanuele
A2 - Artikis, Alexander
A2 - Tatbul, Nesime
A2 - Parzyjegla, Helge
PB - Association for Computing Machinery, Inc
T2 - 15th ACM International Conference on Distributed and Event-Based Systems, DEBS 2021
Y2 - 28 June 2021 through 2 July 2021
ER -