Don't Believe Everything You Read: Enhancing Summarization Interpretability through Automatic Identification of Hallucinations in Large Language Models

大语言模型 · 语言模型化 · MoDELS · 可辨认的 · Everything（软件） ·

2023 年 12 月 22 日

翻译：不要轻信你所读到的一切：通过自动识别大语言模型中的幻觉来增强摘要生成的可解释性

Priyesh Vakharia,Devavrat Joshi,Meenal Chavan,Dhananjay Sonawane,Bhrigu Garg,Parsa Mazaheri,Ian Lane

from arxiv, All authors contributed equally to this work

Large Language Models (LLMs) are adept at text manipulation -- tasks such as machine translation and text summarization. However, these models can also be prone to hallucination, which can be detrimental to the faithfulness of any answers that the model provides. Recent works in combating hallucinations in LLMs deal with identifying hallucinated sentences and categorizing the different ways in which models hallucinate. This paper takes a deep dive into LLM behavior with respect to hallucinations, defines a token-level approach to identifying different kinds of hallucinations, and further utilizes this token-level tagging to improve the interpretability and faithfulness of LLMs in dialogue summarization tasks. Through this, the paper presents a new, enhanced dataset and a new training paradigm.

翻译：大语言模型（LLMs）擅长文本操作——例如机器翻译和文本摘要等任务。然而，这些模型也容易产生幻觉，这可能损害模型所提供任何答案的忠实性。近期对抗大语言模型中幻觉的研究主要涉及识别存在幻觉的句子以及分类模型产生幻觉的不同方式。本文深入探究了大语言模型在幻觉方面的行为，定义了识别不同类型幻觉的令牌级方法，并进一步利用这种令牌级标记来提升大语言模型在对话摘要任务中的可解释性和忠实性。通过这项工作，本文提出了一个新的增强型数据集和一种新的训练范式。

相关内容

大语言模型

关注 66

大语言模型是基于海量文本数据训练的深度学习模型。它不仅能够生成自然语言文本，还能够深入理解文本含义，处理各种自然语言任务，如文本摘要、问答、翻译等。2023年，大语言模型及其在人工智能领域的应用已成为全球科技研究的热点，其在规模上的增长尤为引人注目，参数量已从最初的十几亿跃升到如今的一万亿。参数量的提升使得模型能够更加精细地捕捉人类语言微妙之处，更加深入地理解人类语言的复杂性。在过去的一年里，大语言模型在吸纳新知识、分解复杂任务以及图文对齐等多方面都有显著提升。随着技术的不断成熟，它将不断拓展其应用范围，为人类提供更加智能化和个性化的服务，进一步改善人们的生活和生产方式。

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日