Model Guidance via Explanations Turns Image Classifiers into Segmentation Models

Heatmaps generated on inputs of image classification networks via explainable AI methods like Grad-CAM and LRP have been observed to resemble segmentations of input images in many cases. Consequently, heatmaps have also been leveraged for achieving weakly supervised segmentation with image-level supervision. On the other hand, losses can be imposed on differentiable heatmaps, which has been shown to serve for (1)~improving heatmaps to be more human-interpretable, (2)~regularization of networks towards better generalization, (3)~training diverse ensembles of networks, and (4)~for explicitly ignoring confounding input features. Due to the latter use case, the paradigm of imposing losses on heatmaps is often referred to as "Right for the right reasons". We unify these two lines of research by investigating semi-supervised segmentation as a novel use case for the Right for the Right Reasons paradigm. First, we show formal parallels between differentiable heatmap architectures and standard encoder-decoder architectures for image segmentation. Second, we show that such differentiable heatmap architectures yield competitive results when trained with standard segmentation losses. Third, we show that such architectures allow for training with weak supervision in the form of image-level labels and small numbers of pixel-level labels, outperforming comparable encoder-decoder models. Code is available at \url{https://github.com/Kainmueller-Lab/TW-autoencoder}.

翻译：通过可解释AI方法（如Grad-CAM和LRP）在图像分类网络输入上生成的热力图，在许多情况下被观察到类似于输入图像的分割结果。因此，热力图也被用于实现基于图像级监督的弱监督分割。另一方面，可在可微分热力图上施加损失函数，这已被证明可用于：（1）改进热力图以使其更易于人类理解；（2）对网络进行正则化以提升泛化能力；（3）训练多样化的网络集成；（4）显式忽略输入中的混淆特征。基于最后一种用途，在热力图上施加损失函数的范式常被称为“为正确理由而正确”。本研究通过探索半监督分割作为“为正确理由而正确”范式的新应用场景，统一了这两个研究方向。首先，我们展示了可微分热力图架构与标准图像分割编码器-解码器架构之间的形式化对应关系。其次，我们证明此类可微分热力图架构在使用标准分割损失训练时能获得具有竞争力的结果。第三，我们证明此类架构能够以图像级标签和少量像素级标签形式的弱监督进行训练，其性能优于可比的编码器-解码器模型。代码发布于 \url{https://github.com/Kainmueller-Lab/TW-autoencoder}。

相关内容

MoDELS

关注 0

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日