Toward Faithful Explanations in Acoustic Anomaly Detection

from arxiv, Accepted at the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2026. Code: https://github.com/Maab-Nimir/Faithful-Explanations-in-Acoustic-Anomaly-Detection

Interpretability is essential for user trust in real-world anomaly detection applications. However, deep learning models, despite their strong performance, often lack transparency. In this work, we study the interpretability of autoencoder-based models for audio anomaly detection, by comparing a standard autoencoder (AE) with a mask autoencoder (MAE) in terms of detection performance and interpretability. We applied several attribution methods, including error maps, saliency maps, SmoothGrad, Integrated Gradients, GradSHAP, and Grad-CAM. Although MAE shows a slightly lower detection, it consistently provides more faithful and temporally precise explanations, suggesting a better alignment with true anomalies. To assess the relevance of the regions highlighted by the explanation method, we propose a perturbation-based faithfulness metric that replaces them with their reconstructions to simulate normal input. Our findings, based on experiments in a real industrial scenario, highlight the importance of incorporating interpretability into anomaly detection pipelines and show that masked training improves explanation quality without compromising performance.

翻译：在现实世界的异常检测应用中，可解释性对于建立用户信任至关重要。然而，深度学习模型尽管性能强大，却常常缺乏透明度。在本工作中，我们通过比较标准自编码器与掩码自编码器在检测性能和可解释性方面的表现，研究了基于自编码器的音频异常检测模型的可解释性。我们应用了多种归因方法，包括误差图、显著图、SmoothGrad、积分梯度、GradSHAP 和 Grad-CAM。尽管 MAE 的检测性能略低，但它始终能提供更忠实且时间上更精确的解释，表明其与真实异常有更好的对齐。为了评估解释方法所突出区域的相关性，我们提出了一种基于扰动的忠实度度量，即用其重构来替换这些区域以模拟正常输入。基于真实工业场景的实验结果表明，我们的发现强调了将可解释性纳入异常检测流程的重要性，并表明掩码训练能在不损害性能的前提下提高解释质量。

相关内容

异常检测

关注 102

在数据挖掘中，异常检测（英语：anomaly detection）对不符合预期模式或数据集中其他项目的项目、事件或观测值的识别。通常异常项目会转变成银行欺诈、结构缺陷、医疗问题、文本错误等类型的问题。异常也被称为离群值、新奇、噪声、偏差和例外。特别是在检测滥用与网络入侵时，有趣性对象往往不是罕见对象，但却是超出预料的突发活动。这种模式不遵循通常统计定义中把异常点看作是罕见对象，于是许多异常检测方法（特别是无监督的方法）将对此类数据失效，除非进行了合适的聚集。相反，聚类分析算法可能可以检测出这些模式形成的微聚类。有三大类异常检测方法。[1] 在假设数据集中大多数实例都是正常的前提下，无监督异常检测方法能通过寻找与其他数据最不匹配的实例来检测出未标记测试数据的异常。监督式异常检测方法需要一个已经被标记“正常”与“异常”的数据集，并涉及到训练分类器（与许多其他的统计分类问题的关键区别是异常检测的内在不均衡性）。半监督式异常检测方法根据一个给定的正常训练数据集创建一个表示正常行为的模型，然后检测由学习模型生成的测试实例的可能性。

【NeurIPS2025】《LeapFactual：基于条件流匹配的可靠视觉反事实解释》

专知会员服务

10+阅读 · 2025年10月19日

《利用视觉问题解答进行异常检测》美陆军实验室报告

专知会员服务

23+阅读 · 2024年5月21日

【ICML2021】基于迁移的语义异常检测

专知会员服务

17+阅读 · 2021年9月8日

反事实如何理解？看这份华为KDD2021《反事实解释及在XAI中的应用》教程，附143页Slides

专知会员服务

105+阅读 · 2021年8月16日