FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs

Large Language Models (LLMs) frequently generate hallucinated content, posing significant challenges for applications where factuality is crucial. While existing hallucination detection methods typically operate at the sentence level or passage level, we propose FactSelfCheck, a novel zero-resource black-box sampling-based method that enables fine-grained fact-level detection. Our approach represents text as interpretable knowledge graphs consisting of facts in the form of triples, providing clearer insights into content factuality than traditional approaches. Through analyzing factual consistency across multiple LLM responses, we compute fine-grained hallucination scores without requiring external resources or training data. Our evaluation demonstrates that FactSelfCheck performs competitively with leading sentence-level sampling-based methods while providing more detailed and interpretable insights. Most notably, our fact-level approach significantly improves hallucination correction, achieving a 35.5% increase in factual content compared to the baseline, while sentence-level SelfCheckGPT yields only a 10.6% improvement. The granular nature of our detection enables more precise identification and correction of hallucinated content. Additionally, we contribute FavaMultiSamples, a novel dataset that addresses a gap in the field by providing the research community with a second dataset for evaluating sampling-based methods.

翻译：大语言模型（LLMs）经常生成包含幻觉的内容，这对事实准确性至关重要的应用场景构成了重大挑战。现有的幻觉检测方法通常在句子级别或段落级别进行操作，而本文提出FactSelfCheck——一种新颖的零资源黑盒采样方法，能够实现细粒度的事实级检测。我们的方法将文本表示为由三元组形式的事实构成的可解释知识图谱，相比传统方法能够更清晰地揭示内容的事实性。通过分析多个LLM响应之间的事实一致性，我们可以在无需外部资源或训练数据的情况下计算细粒度的幻觉分数。评估结果表明，FactSelfCheck与领先的基于采样的句子级方法相比具有竞争力，同时提供更详细且可解释的洞察。最值得注意的是，我们的事实级方法显著提升了幻觉修正效果，相比基线模型实现了35.5%的事实内容增长，而句子级的SelfCheckGPT仅带来10.6%的改进。我们检测方法的细粒度特性使得能够更精确地识别和修正幻觉内容。此外，我们贡献了FavaMultiSamples数据集，该数据集通过为研究社区提供第二个用于评估基于采样方法的基准数据集，弥补了该领域的空白。

相关内容

黑盒

关注 1

在科学，计算和工程学中，黑盒是一种设备，系统或对象，可以根据其输入和输出（或传输特性）对其进行查看，而无需对其内部工作有任何了解。它的实现是“不透明的”（黑色）。几乎任何事物都可以被称为黑盒：晶体管，引擎，算法，人脑，机构或政府。为了使用典型的“黑匣子方法”来分析建模为开放系统的事物，仅考虑刺激/响应的行为，以推断（未知）盒子。该黑匣子系统的通常表示形式是在该方框中居中的数据流程图。黑盒的对立面是一个内部组件或逻辑可用于检查的系统，通常将其称为白盒（有时也称为“透明盒”或“玻璃盒”）。

《缓解大语言模型（LLMs）幻觉：面向应用的检索增强生成（RAG）、推理与智能体系统综述》

专知会员服务

24+阅读 · 2025年10月29日

大语言模型幻觉：系统综述

专知会员服务

40+阅读 · 2025年10月10日

【NeurIPS 2024】HaloScope：利用未标记的大型语言模型生成进行幻觉检测

专知会员服务

20+阅读 · 2024年9月27日

【CIKM2024】使用大型视觉语言模型的多模态虚假信息检测

专知会员服务

18+阅读 · 2024年7月22日