HALO: An Ontology for Representing Hallucinations in Generative Models

Recent progress in generative AI, including large language models (LLMs) like ChatGPT, has opened up significant opportunities in fields ranging from natural language processing to knowledge discovery and data mining. However, there is also a growing awareness that the models can be prone to problems such as making information up or `hallucinations', and faulty reasoning on seemingly simple problems. Because of the popularity of models like ChatGPT, both academic scholars and citizen scientists have documented hallucinations of several different types and severity. Despite this body of work, a formal model for describing and representing these hallucinations (with relevant meta-data) at a fine-grained level, is still lacking. In this paper, we address this gap by presenting the Hallucination Ontology or HALO, a formal, extensible ontology written in OWL that currently offers support for six different types of hallucinations known to arise in LLMs, along with support for provenance and experimental metadata. We also collect and publish a dataset containing hallucinations that we inductively gathered across multiple independent Web sources, and show that HALO can be successfully used to model this dataset and answer competency questions.

翻译：摘要：生成式人工智能的最新进展，包括ChatGPT等大型语言模型，已在自然语言处理、知识发现与数据挖掘等领域开辟了重大机遇。然而，人们也日益认识到这些模型存在易产生问题，例如捏造信息或"幻觉"，以及在看似简单的问题上出现错误推理。由于ChatGPT等模型的普及，学术学者和公民科学家都已记录下多种不同类型和严重程度的幻觉现象。尽管已有大量相关研究，但目前仍缺乏一种能够细粒度描述和表示这些幻觉（并附带相关元数据）的正式模型。本文通过提出"幻觉本体"（HALO）来填补这一空白——这是一种用OWL编写的可扩展形式化本体，当前支持大型语言模型中已知的六种幻觉类型，并具备来源和实验元数据支持。我们还收集并发布了一个数据集，其中包含从多个独立网络来源归纳整理的幻觉实例，并证明HALO可成功用于建模该数据集并回答能力问题。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日