Using Explanations to Guide Models

Deep neural networks are highly performant, but might base their decision on spurious or background features that co-occur with certain classes, which can hurt generalization. To mitigate this issue, the usage of 'model guidance' has gained popularity recently: for this, models are guided to be "right for the right reasons" by regularizing the models' explanations to highlight the right features. Experimental validation of these approaches has thus far however been limited to relatively simple and / or synthetic datasets. To gain a better understanding of which model-guiding approaches actually transfer to more challenging real-world datasets, in this work we conduct an in-depth evaluation across various loss functions, attribution methods, models, and 'guidance depths' on the PASCAL VOC 2007 and MS COCO 2014 datasets, and show that model guidance can sometimes even improve model performance. In this context, we further propose a novel energy loss, show its effectiveness in directing the model to focus on object features. We also show that these gains can be achieved even with a small fraction (e.g. 1%) of bounding box annotations, highlighting the cost effectiveness of this approach. Lastly, we show that this approach can also improve generalization under distribution shifts. Code will be made available.

翻译：深度神经网络虽然性能卓越，但可能基于与某些类别共存的虚假特征或背景特征做出决策，这会损害模型的泛化能力。为解决这一问题，"模型指导"方法近期备受关注：通过正则化模型解释以突出正确特征，从而引导模型"以正确理由得出正确结论"。然而，目前对这些方法的实验验证仍局限于相对简单或合成的数据集。为深入探究哪些模型指导方法能真正迁移至更具挑战性的真实数据集，本研究在PASCAL VOC 2007和MS COCO 2014数据集上，针对多种损失函数、归因方法、模型及"指导深度"进行了全面评估，结果表明模型指导有时甚至能提升模型性能。在此背景下，我们进一步提出一种新型能量损失函数，并证明其在引导模型聚焦物体特征方面的有效性。实验还表明，即使仅使用少量边界框标注（如1%），也能获得性能提升，凸显了该方法的经济性。最后，我们证明该方法在分布偏移下也能改善泛化能力。相关代码将公开。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【ICML2022】基于随机注意力机制的可解释和广义图学习

专知会员服务

33+阅读 · 2022年8月7日

【ACL2022】解释生成的多尺度分布深度变分自编码器, Multi-Scale Distribution Deep Variational Autoencoder for Explanation Generation

专知会员服务

12+阅读 · 2022年3月24日

知识图嵌入和可解释人工智能 Knowledge Graph Embeddings and Explainable AI

专知会员服务

136+阅读 · 2020年5月1日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

116+阅读 · 2020年4月5日