When a Relation Tells More Than a Concept: Exploring and Evaluating Classifier Decisions with CoReX

Explanations for Convolutional Neural Networks (CNNs) based on relevance of input pixels might be too unspecific to evaluate which and how input features impact model decisions. Especially in complex real-world domains like biomedicine, the presence of specific concepts (e.g., a certain type of cell) and of relations between concepts (e.g., one cell type is next to another) might be discriminative between classes (e.g., different types of tissue). Pixel relevance is not expressive enough to convey this type of information. In consequence, model evaluation is limited and relevant aspects present in the data and influencing the model decisions might be overlooked. This work presents a novel method to explain and evaluate CNN models, which uses a concept- and relation-based explainer (CoReX). It explains the predictive behavior of a model on a set of images by masking (ir-)relevant concepts from the decision-making process and by constraining relations in a learned interpretable surrogate model. We test our approach with several image data sets and CNN architectures. Results show that CoReX explanations are faithful to the CNN model in terms of predictive outcomes. We further demonstrate that CoReX is a suitable tool for evaluating CNNs supporting identification and re-classification of incorrect or ambiguous classifications.

翻译：基于输入像素相关性的卷积神经网络（CNN）解释可能过于笼统，难以评估输入特征如何以及是否影响模型决策。尤其是在生物医学等复杂的现实领域，特定概念（例如某种细胞类型）的存在以及概念之间的关系（例如一种细胞类型与另一种相邻）可能对类别（例如不同组织类型）具有判别性。像素相关性不足以传达此类信息，导致模型评估受限，数据中影响模型决策的相关方面可能被忽视。本文提出一种新颖的方法来解释和评估CNN模型，该方法使用基于概念和关系的解释器（CoReX）。它通过从决策过程中屏蔽（不）相关概念，并在可学习的可解释代理模型中约束关系，来解释模型在一组图像上的预测行为。我们在多个图像数据集和CNN架构上测试了该方法。结果表明，CoReX 解释在预测结果上对CNN模型是忠实的。我们进一步证明，CoReX 是评估CNN的合适工具，有助于识别和重新分类错误或模糊的分类。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日