LAP: An Attention-Based Module for Concept Based Self-Interpretation and Knowledge Injection in Convolutional Neural Networks

Despite the state-of-the-art performance of deep convolutional neural networks, they are susceptible to bias and malfunction in unseen situations. Moreover, the complex computation behind their reasoning is not human-understandable to develop trust. External explainer methods have tried to interpret network decisions in a human-understandable way, but they are accused of fallacies due to their assumptions and simplifications. On the other side, the inherent self-interpretability of models, while being more robust to the mentioned fallacies, cannot be applied to the already trained models. In this work, we propose a new attention-based pooling layer, called Local Attention Pooling (LAP), that accomplishes self-interpretability and the possibility for knowledge injection without performance loss. The module is easily pluggable into any convolutional neural network, even the already trained ones. We have defined a weakly supervised training scheme to learn the distinguishing features in decision-making without depending on experts' annotations. We verified our claims by evaluating several LAP-extended models on two datasets, including ImageNet. The proposed framework offers more valid human-understandable and faithful-to-the-model interpretations than the commonly used white-box explainer methods.

翻译：尽管深度卷积神经网络具有最先进的性能，但它们在未见场景中容易产生偏差和故障。此外，其推理背后的复杂计算难以被人类理解以建立信任。外部解释方法试图以人类可理解的方式解释网络决策，但由于其假设和简化而受到质疑。另一方面，模型固有的自解释性虽然对上述谬误更具鲁棒性，但无法应用于已训练的模型。本文提出一种新的基于注意力的池化层——局部注意力池化（LAP），该模块在不损失性能的情况下实现自解释性并支持知识注入。该模块可轻松嵌入任何卷积神经网络，甚至包括已训练的模型。我们定义了一种弱监督训练方案，用于学习决策过程中的区分性特征，而无需依赖专家标注。通过在包括ImageNet在内的两个数据集上评估多个LAP扩展模型，我们验证了上述主张。与常用的白盒解释方法相比，所提框架提供了更有效、更符合人类理解且忠于模型本体的解释。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日