Multi-Level Explanations for Generative Language Models

Lucas Monteiro Paes,Dennis Wei,Hyo Jin Do,Hendrik Strobelt,Ronny Luss,Amit Dhurandhar,Manish Nagireddy,Karthikeyan Natesan Ramamurthy,Prasanna Sattigeri,Werner Geyer,Soumya Ghosh

Perturbation-based explanation methods such as LIME and SHAP are commonly applied to text classification. This work focuses on their extension to generative language models. To address the challenges of text as output and long text inputs, we propose a general framework called MExGen that can be instantiated with different attribution algorithms. To handle text output, we introduce the notion of scalarizers for mapping text to real numbers and investigate multiple possibilities. To handle long inputs, we take a multi-level approach, proceeding from coarser levels of granularity to finer ones, and focus on algorithms with linear scaling in model queries. We conduct a systematic evaluation, both automated and human, of perturbation-based attribution methods for summarization and context-grounded question answering. The results show that our framework can provide more locally faithful explanations of generated outputs.

翻译：扰动式解释方法（如LIME和SHAP）通常应用于文本分类任务。本研究聚焦于将其扩展至生成式语言模型。为应对文本输出和长文本输入带来的挑战，我们提出通用框架MExGen，该框架可通过不同归因算法进行实例化。针对文本输出问题，我们引入"标量器"概念以实现文本到实数的映射，并探究多种实现方案。为处理长序列输入，我们采用多层级策略，从粗粒度逐步过渡到细粒度，并重点研究模型查询次数呈线性扩展的算法。我们针对摘要生成和基于上下文的问答任务，开展了包含自动化评估与人工评估的系统性扰动归因方法评测。实验结果表明，本框架能为生成输出提供更具局部忠实性的解释。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日