Papers
Topics
Authors
Recent
Search
2000 character limit reached

A Survey on Explainable Anomaly Detection

Published 13 Oct 2022 in cs.LG | (2210.06959v2)

Abstract: In the past two decades, most research on anomaly detection has focused on improving the accuracy of the detection, while largely ignoring the explainability of the corresponding methods and thus leaving the explanation of outcomes to practitioners. As anomaly detection algorithms are increasingly used in safety-critical domains, providing explanations for the high-stakes decisions made in those domains has become an ethical and regulatory requirement. Therefore, this work provides a comprehensive and structured survey on state-of-the-art explainable anomaly detection techniques. We propose a taxonomy based on the main aspects that characterize each explainable anomaly detection technique, aiming to help practitioners and researchers find the explainable anomaly detection method that best suits their needs.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (130)
  1. Charu C Aggarwal. 2015. Outlier analysis. In Data mining. Springer, 237–263.
  2. Fast algorithms for mining association rules. In Proc. 20th int. conf. very large data bases, VLDB, Vol. 1215. Santiago, Chile, 487–499.
  3. Shikha Agrawal and Jitendra Agrawal. 2015. Survey on anomaly detection using data mining techniques. Procedia Computer Science 60 (2015), 708–713.
  4. Towards an interpretable autoencoder: a decision tree-based autoencoder and its application in anomaly detection. IEEE Transactions on Dependable and Secure Computing (2022).
  5. A comprehensive survey of numeric and symbolic outlier mining techniques. Intelligent Data Analysis 10, 6 (2006), 521–538.
  6. A survey of anomaly detection techniques in financial domain. Future Generation Computer Systems 55 (2016), 278–288.
  7. Vehicle operating state anomaly detection and results virtual reality interpretation. Expert Systems with Applications 177 (2021), 114928.
  8. Toward explainable deep neural network based anomaly detection. In 2018 11th International Conference on Human System Interaction (HSI). IEEE, 311–317.
  9. Outlying property detection with numerical attributes. Data mining and knowledge discovery 31, 1 (2017), 134–163.
  10. Detecting outlying properties of exceptional objects. Acm transactions on database systems (tods) 34, 1 (2009), 1–62.
  11. Discovering characterizations of the behavior of anomalous subpopulations. IEEE Transactions on knowledge and data engineering 25, 6 (2012), 1280–1292.
  12. Explaining anomalies detected by autoencoders using SHAP. arXiv preprint arXiv:1903.02407 (2019).
  13. Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Information fusion 58 (2020), 82–115.
  14. An anomaly detection and explainability framework using convolutional autoencoders for data storage systems. In Proceedings of the Twenty-Ninth International Conference on International Joint Conferences on Artificial Intelligence. 5228–5230.
  15. Ava: automated interpretation of dynamically detected anomalies. In Proceedings of the eighteenth international symposium on Software testing and analysis. 237–248.
  16. Autoencoders. arXiv preprint arXiv:2003.05991 (2020).
  17. Alberto Barbado. 2020. Anomaly detection in average fuel consumption with XAI techniques for dynamic generation of explanations. ArXiv abs/2010.16051 (2020).
  18. Rule extraction in unsupervised anomaly detection for model explainability: Application to OneClass SVM. Expert Systems with Applications 189 (2022), 116100.
  19. Explainable machine learning with prior knowledge: an overview. arXiv preprint arXiv:2105.10172 (2021).
  20. Vaishak Belle and Ioannis Papantonis. 2021. Principles and practice of explainable machine learning. Frontiers in big Data (2021), 39.
  21. Towards Gaussian Processes for Automatic and Interpretable Anomaly Detection in Industry 4.0.. In IN4PL. 87–92.
  22. Abnormal Subspace Sparse PCA for Anomaly Detection and Interpretation. arXiv preprint arXiv:1605.04644 (2016).
  23. Anomaly Detection in Autonomous Driving: A Survey. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4488–4499.
  24. Kristof Böhmer and Stefanie Rinderle-Ma. 2020. Mining association rules for anomaly detection in dynamic process runtime behavior and explaining the root cause to users. Information Systems 90 (2020), 101438.
  25. Outlier detection: Methods, models, and classification. ACM Computing Surveys (CSUR) 53, 3 (2020), 1–37.
  26. LOF: identifying density-based local outliers. In Proceedings of the 2000 ACM SIGMOD international conference on Management of data. 93–104.
  27. David A Broniatowski. 2021. Psychological foundations of explainability and interpretability in artificial intelligence. NIST, Tech. Rep (2021).
  28. Recurrent neural network attention mechanisms for interpretable system log anomaly detection. In Proceedings of the First Workshop on Machine Learning for Computing Systems. 1–8.
  29. Detection and interpretation of anomalies in building energy use through inverse modeling. Science and Technology for the Built Environment 25, 4 (2019), 488–503.
  30. Nadia Burkart and Marco F Huber. 2021. A survey on the explainability of supervised machine learning. Journal of Artificial Intelligence Research 70 (2021), 245–317.
  31. Interpretable anomaly detection for knowledge discovery in semiconductor manufacturing. In 2020 Winter Simulation Conference (WSC). IEEE, 1875–1885.
  32. Interpretable anomaly detection with diffi: Depth-based isolation forest feature importance. arXiv preprint arXiv:2007.11117 (2020).
  33. Gail A Carpenter and Stephen Grossberg. 1987. Neural dynamics of category learning and recognition: Attention, memory consolidation, and amnesia. In Advances in psychology. Vol. 42. Elsevier, 239–286.
  34. Machine learning interpretability: A survey on methods and metrics. Electronics 8, 8 (2019), 832.
  35. Chandranil Chakraborttii and Heiner Litz. 2020. Explaining SSD Failures using Anomaly Detection. In Non-Volatile Memory Workshop, Vol. 1. 1.
  36. Raghavendra Chalapathy and Sanjay Chawla. 2019. Deep learning for anomaly detection: A survey. arXiv preprint arXiv:1901.03407 (2019).
  37. Anomaly detection: A survey. ACM computing surveys (CSUR) 41, 3 (2009), 1–58.
  38. Data-Efficient and Interpretable Tabular Anomaly Detection. arXiv preprint arXiv:2203.02034 (2022).
  39. Anomaly detection using graph neural networks. In 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon). IEEE, 346–350.
  40. Interpretable unsupervised anomaly detection for RAN cell trace analysis. In 2020 16th International Conference on Network and Service Management (CNSM). IEEE, 1–5.
  41. Prediction-based outlier detection with explanations. In 2012 IEEE International Conference on Granular Computing. IEEE, 44–49.
  42. Failure diagnosis using decision trees. In International Conference on Autonomic Computing, 2004. Proceedings. IEEE, 36–43.
  43. Scene Graphs for Interpretable Video Anomaly Classification. In Conference on Neural Information Processing Systems Workshop on Visually Grounded Interaction and Language.
  44. Tianqi Chen and Carlos Guestrin. 2016. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining. 785–794.
  45. Daemon: Unsupervised anomaly detection and interpretation for multivariate time series. In 2021 IEEE 37th International Conference on Data Engineering (ICDE). IEEE, 2225–2230.
  46. Multi-scale detection and interpretation of spatio-temporal anomalies of human activities represented by time-series. Computers, Environment and Urban Systems 88 (2021), 101627.
  47. Interpretable stock anomaly detection based on spatio-temporal relation networks with genetic algorithm. IEEE Access 9 (2021), 68302–68319.
  48. Toward Scalable and Unified Example-Based Explanation and Outlier Detection. IEEE Transactions on Image Processing 31 (2021), 525–540.
  49. William S Cleveland. 1979. Robust locally weighted regression and smoothing scatterplots. Journal of the American statistical association 74, 368 (1979), 829–836.
  50. European Commission. 2020. On Artificial Intelligence—A European Approach to Excellence and Trust.
  51. David Cortes. 2020. Explainable outlier detection through decision tree conditioning. ArXiv abs/2001.00636 (2020).
  52. Discriminative features for identifying and interpreting outliers. In 2014 IEEE 30th international conference on data engineering. IEEE, 88–99.
  53. Local outlier detection with interpretation. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 304–320.
  54. Active anomaly detection via ensembles: Insights, algorithms, and interpretability. arXiv preprint arXiv:1901.08930 (2019).
  55. Ian Davidson. 2007. Anomaly detection, explanation and visualization. SGI, Tokyo, Japan, Tech. Rep (2007).
  56. An interpretable lane change detector algorithm based on deep autoencoder anomaly detection. In 2021 IEEE Intelligent Vehicles Symposium (IV). IEEE, 516–523.
  57. Leonardo De Moura and Nikolaj Bjørner. 2011. Satisfiability modulo theories: introduction and applications. Commun. ACM 54, 9 (2011), 69–77.
  58. Credit card fraud and detection techniques: a review. Banks and Bank systems 4, 2 (2009), 57–68.
  59. Interpretable Anomaly Detection with Mondrian Polya Forests on Data Streams. arXiv preprint arXiv:2008.01505 (2020).
  60. A robust interpretable deep learning classifier for heart anomaly detection without segmentation. IEEE Journal of Biomedical and Health Informatics 25, 6 (2020), 2162–2171.
  61. Finale Doshi-Velez and Been Kim. 2017. Towards a rigorous science of interpretable machine learning. arXiv preprint arXiv:1702.08608 (2017).
  62. Explainable artificial intelligence: A survey. In 2018 41st International convention on information and communication technology, electronics and microelectronics (MIPRO). IEEE, 0210–0215.
  63. Deeplog: Anomaly detection and diagnosis from system logs through deep learning. In Proceedings of the 2017 ACM SIGSAC conference on computer and communications security. 1285–1298.
  64. Mining outlying aspects on numeric data. Data Mining and Knowledge Discovery 29, 5 (2015), 1116–1151.
  65. Ricardo Dunia and S Joe Qin. 1997. Multi-dimensional fault diagnosis using a subspace approach. In American Control Conference, Vol. 5.
  66. Anomalies in multidimensional contexts. WIT Transa. Inform. Commun. Technol 42 (2009), 173.
  67. Minds-minnesota intrusion detection system. Next generation data mining (2004), 199–218.
  68. Anomaly detection in videos using two-stream autoencoder with post hoc interpretability. Computational Intelligence and Neuroscience 2021 (2021).
  69. Heart sound segmentation using bidirectional LSTMs with attention. IEEE journal of biomedical and health informatics 24, 6 (2019), 1601–1609.
  70. Improving spacecraft health monitoring with automatic anomaly detection techniques. In 14th international conference on space operations. 2430.
  71. Explaining explanations: An overview of interpretability of machine learning. In 2018 IEEE 5th International Conference on data science and advanced analytics (DSAA). IEEE, 80–89.
  72. Ioana Giurgiu and Anika Schumann. 2019. Additive explanations for anomalies detected from multivariate temporal data. In Proceedings of the 28th acm international conference on information and knowledge management. 2245–2248.
  73. XAI in the Audit Domain-Explaining an Autoencoder Model for Anomaly Detection. (2022).
  74. Convolutional neural networks for unsupervised anomaly detection in text data. In International Conference on Intelligent Data Engineering and Automated Learning. Springer, 500–507.
  75. Towards Intelligible Robust Anomaly Detection by Learning Interpretable Behavioural Models. NISK J 32 (2019), 1–16.
  76. A survey of methods for explaining black box models. ACM computing surveys (CSUR) 51, 5 (2018), 1–42.
  77. Interpretable anomaly detection in event sequences via sequence matching and visual comparison. IEEE Transactions on Visualization and Computer Graphics (2021).
  78. Outlier detection for temporal data: A survey. IEEE Transactions on Knowledge and data Engineering 26, 9 (2013), 2250–2267.
  79. Beyond outlier detection: Lookout for pictorial explanation. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 122–138.
  80. InterpretableSAD: Interpretable Anomaly Detection in Sequential Log Data. In 2021 IEEE International Conference on Big Data (Big Data). IEEE, 1183–1192.
  81. Douglas M Hawkins. 1980. Identification of outliers. Vol. 11. Springer.
  82. Jingrui He and Jaime Carbonell. 2010. Co-selection of features and instances for unsupervised rare category analysis. In Proceedings of the 2010 SIAM International Conference on Data Mining. SIAM, 525–536.
  83. FP-outlier: Frequent pattern based outlier detection. Computer Science and Information Systems 2, 1 (2005), 103–118.
  84. Outlier Detection with Explanations on Music Streaming Data: A Case Study with Danmark Music Group Ltd. Applied Sciences 11, 5 (2021), 2270.
  85. Joint detection and recounting of abnormal events by learning deep generic knowledge. In Proceedings of the IEEE international conference on computer vision. 3619–3627.
  86. Victoria Hodge and Jim Austin. 2004. A survey of outlier detection methodologies. Artificial intelligence review 22, 2 (2004), 85–126.
  87. Chanwoong Hwang and Taejin Lee. 2021. E-sfd: Explainable sensor fault detection in the ics anomaly detection system. IEEE Access 9 (2021), 140470–140486.
  88. Anomaly detection and interpretation using multimodal autoencoder and sparse optimization. arXiv preprint arXiv:1812.07136 (2018).
  89. Estimation of dimensions contributing to detected anomalies with variational autoencoders. arXiv preprint arXiv:1811.04576 (2018).
  90. A one-class classification decision tree based on kernel density estimation. Applied soft computing 91 (2020), 106250.
  91. Anomaly Detection in Asset Degradation Process Using Variational Autoencoder and Explanations. Sensors 22, 1 (2021), 291.
  92. Explainable anomaly detection for Hot-rolling industrial process. In 2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA). IEEE, 1–10.
  93. An improved k-prototypes clustering algorithm for mixed numeric and categorical data. Neurocomputing 120 (2013), 590–596.
  94. A modified principal component technique based on the LASSO. Journal of computational and Graphical Statistics 12, 3 (2003), 531–547.
  95. Why Are You Weird? Infusing Interpretability in Isolation Forest for Anomaly Detection. arXiv preprint arXiv:2112.06858 (2021).
  96. Towards explaining anomalies: a deep Taylor decomposition of one-class models. Pattern Recognition 101 (2020), 107198.
  97. The clever Hans effect in anomaly detection. arXiv preprint arXiv:2006.10609 (2020).
  98. Lightgbm: A highly efficient gradient boosting decision tree. Advances in neural information processing systems 30 (2017).
  99. HiCS: High contrast subspaces for density-based outlier ranking. In 2012 IEEE 28th international conference on data engineering. IEEE, 1037–1048.
  100. Flexible and adaptive subspace search for outlier analysis. In Proceedings of the 22nd ACM international conference on Information & Knowledge Management. 1381–1390.
  101. Hot sax: Efficiently finding the most unusual time series subsequence. In Fifth IEEE International Conference on Data Mining (ICDM’05). Ieee, 8–pp.
  102. Sebastian Kiefer and Günter Pesch. 2021. Unsupervised Anomaly Detection for Financial Auditing with Model-Agnostic Explanations. In German Conference on Artificial Intelligence (Künstliche Intelligenz). Springer, 291–308.
  103. Explainable Anomaly Detection Framework for Maritime Main Engine Sensor Data. Sensors 21, 15 (2021), 5200.
  104. Shogo Kitamura and Yuichi Nonaka. 2019. Explainable anomaly detection via feature-based localization. In International Conference on Artificial Neural Networks. Springer, 408–419.
  105. Edwin M Knorr and Raymond T Ng. 1998. Algorithms for mining distance-based outliers in large datasets. In VLDB, Vol. 98. Citeseer, 392–403.
  106. Edwin M Knorr and Raymond T Ng. 1999. Finding intensional knowledge of distance-based outliers. In Vldb, Vol. 99. Citeseer, 211–222.
  107. Interpreting and clustering outliers with sapling random forests. In ITAT 2014. European conference on information technologies—applications and theory. Institute of Computer Science AS CR. 61–67.
  108. Human-Interpretable Rules for Anomaly Detection in Time-series. In INTERNATIONAL CONFERENCE ON EXTENDING DATABASE TECHNOLOGY. OpenProceedings. org, 457–462.
  109. Outlier detection in arbitrarily oriented subspaces. In 2012 IEEE 12th international conference on data mining. IEEE, 379–388.
  110. Chia-Tung Kuo and Ian Davidson. 2016. A framework for outlier description using constraint programming. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 30.
  111. Interpretable Anomaly Prediction: Predicting anomalous behavior in industry 4.0 settings via regularized logistic regression tools. Data & Knowledge Engineering 130 (2020), 101850.
  112. Unmasking Clever Hans predictors and assessing what machines really learn. Nature communications 10, 1 (2019), 1–8.
  113. Stefan Leue and Mitra Tabaei Befrouei. 2012. Counterexample explanation by anomaly detection. In International SPIN Workshop on Model Checking of Software. Springer, 24–42.
  114. AnoMili: Spoofing Prevention and Explainable Anomaly Detection for the 1553 Military Avionic Bus. arXiv preprint arXiv:2202.06870 (2022).
  115. VAGA: Towards Accurate and Interpretable Outlier Detection Based on Variational Auto-Encoder and Genetic Algorithm for High-Dimensional Data. In 2021 IEEE International Conference on Big Data (Big Data). IEEE, 5956–5958.
  116. Stacking VAE with graph neural networks for effective and interpretable time series anomaly detection. arXiv preprint arXiv:2105.08397 (2021).
  117. Multivariate time series anomaly detection and interpretation using hierarchical inter-metric and temporal embedding. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 3220–3230.
  118. Robust and rapid clustering of kpis for large-scale anomaly detection. In 2018 IEEE/ACM 26th International Symposium on Quality of Service (IWQoS). IEEE, 1–10.
  119. A survey of transformers. arXiv preprint arXiv:2106.04554 (2021).
  120. Explainable ai: A review of machine learning interpretability methods. Entropy 23, 1 (2020), 18.
  121. Zachary C Lipton. 2018. The mythos of model interpretability: In machine learning, the concept of interpretability is both important and slippery. Queue 16, 3 (2018), 31–57.
  122. Isolation forest. In 2008 eighth ieee international conference on data mining. IEEE, 413–422.
  123. Contextual outlier interpretation. arXiv preprint arXiv:1711.10589 (2017).
  124. Discovering spatio-temporal causal interactions in traffic data streams. In Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining. 1010–1018.
  125. Interpretable Anomaly Detection in Variable-Length Co-Evolving Rhythmic Sequences. (2022).
  126. Explainable deep one-class classification. arXiv preprint arXiv:2007.01760 (2020).
  127. S Lundberg and SI Lee. 2021. A game theoretic approach to explain the output of any machine learning model. Github (2021).
  128. Scott M Lundberg and Su-In Lee. 2017. A unified approach to interpreting model predictions. Advances in neural information processing systems 30 (2017).
  129. Meghanath Macha and Leman Akoglu. 2018. Explaining anomalies in groups with characterizing subspace rules. Data Mining and Knowledge Discovery 32, 5 (2018), 1444–1480.
  130. Adversarial autoencoders. arXiv preprint arXiv:1511.05644 (2015).
Citations (52)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.