One Wave to Explain Them All: A Unifying Perspective on Post-hoc Explainability

Despite the growing use of deep neural networks in safety-critical decision-making, their inherent black-box nature hinders transparency and interpretability. Explainable AI (XAI) methods have thus emerged to understand a model's internal workings, and notably attribution methods also called saliency maps. Conventional attribution methods typically identify the locations -- the where -- of significant regions within an input. However, because they overlook the inherent structure of the input data, these methods often fail to interpret what these regions represent in terms of structural components (e.g., textures in images or transients in sounds). Furthermore, existing methods are usually tailored to a single data modality, limiting their generalizability. In this paper, we propose leveraging the wavelet domain as a robust mathematical foundation for attribution. Our approach, the Wavelet Attribution Method (WAM) extends the existing gradient-based feature attributions into the wavelet domain, providing a unified framework for explaining classifiers across images, audio, and 3D shapes. Empirical evaluations demonstrate that WAM matches or surpasses state-of-the-art methods across faithfulness metrics and models in image, audio, and 3D explainability. Finally, we show how our method explains not only the where -- the important parts of the input -- but also the what -- the relevant patterns in terms of structural components.

翻译：尽管深度神经网络在安全关键决策中的应用日益广泛，但其固有的黑箱特性阻碍了透明度和可解释性。因此，可解释人工智能（XAI）方法应运而生，旨在理解模型的内部运作机制，其中显著图（亦称归因方法）尤为突出。传统的归因方法通常识别输入中重要区域的位置——即“何处”。然而，由于这些方法忽视了输入数据的内在结构，它们往往无法解释这些区域在结构组件（如图像中的纹理或声音中的瞬态）方面代表什么。此外，现有方法通常仅针对单一数据模态设计，限制了其泛化能力。本文提出利用小波域作为归因的鲁棒数学基础。我们的方法——小波归因方法（WAM）——将现有的基于梯度的特征归因扩展到小波域，为解释图像、音频和三维形状的分类器提供了一个统一框架。实证评估表明，在图像、音频和三维可解释性方面，WAM在忠实度指标和模型上均达到或超越了现有最先进方法。最后，我们展示了该方法不仅解释了“何处”——输入的重要部分——还解释了“什么”——结构组件方面的相关模式。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日