LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tools and Self-Explanations

Interpretability tools that offer explanations in the form of a dialogue have demonstrated their efficacy in enhancing users' understanding (Slack et al., 2023; Shen et al., 2023), as one-off explanations may fall short in providing sufficient information to the user. Current solutions for dialogue-based explanations, however, often require external tools and modules and are not easily transferable to tasks they were not designed for. With LLMCheckup, we present an easily accessible tool that allows users to chat with any state-of-the-art large language model (LLM) about its behavior. We enable LLMs to generate explanations and perform user intent recognition without fine-tuning, by connecting them with a broad spectrum of Explainable AI (XAI) methods, including white-box explainability tools such as feature attributions, and self-explanations (e.g., for rationale generation). LLM-based (self-)explanations are presented as an interactive dialogue that supports follow-up questions and generates suggestions. LLMCheckupprovides tutorials for operations available in the system, catering to individuals with varying levels of expertise in XAI and supporting multiple input modalities. We introduce a new parsing strategy that substantially enhances the user intent recognition accuracy of the LLM. Finally, we showcase LLMCheckup for the tasks of fact checking and commonsense question answering.

翻译：以对话形式提供解释的可解释性工具已被证明能有效提升用户理解（Slack等，2023；Shen等，2023），因为一次性解释可能无法为用户提供足够信息。然而，现有基于对话的解释方案通常依赖外部工具和模块，且难以迁移至非预设任务。通过LLMCheckup，我们提出一种便捷工具，允许用户与任何最先进的大语言模型（LLM）就其行为进行对话。通过将LLM与广泛的可解释人工智能（XAI）方法（包括特征归因等白盒可解释性工具和用于原理生成的自我解释）相连接，我们使LLM无需微调即可生成解释并执行用户意图识别。基于LLM的（自）解释以支持追问和生成建议的交互式对话形式呈现。LLMCheckup提供系统操作教程，面向XAI领域不同专业水平的用户，并支持多模态输入。我们提出一种新的解析策略，显著提升了LLM的用户意图识别准确率。最后，我们展示了LLMCheckup在事实核查和常识问答任务中的应用。

相关内容

TOOLS

关注 1

这个新版本的工具会议系列恢复了从1989年到2012年的50个会议的传统。工具最初是“面向对象语言和系统的技术”，后来发展到包括软件技术的所有创新方面。今天许多最重要的软件概念都是在这里首次引入的。2019年TOOLS 50+1在俄罗斯喀山附近举行，以同样的创新精神、对所有与软件相关的事物的热情、科学稳健性和行业适用性的结合以及欢迎该领域所有趋势和社区的开放态度，延续了该系列。官网链接：http://tools2019.innopolis.ru/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日