嵌入信任：语义各向同性预测长文本生成中的非事实性 (Embedding Trust: Semantic Isotropy Predicts Nonfactuality in Long-Form Text Generation)

To deploy large language models (LLMs) in high-stakes application domains that require substantively accurate responses to open-ended prompts, we need reliable, computationally inexpensive methods that assess the trustworthiness of long-form responses generated by LLMs. However, existing approaches often rely on claim-by-claim fact-checking, which is computationally expensive and brittle in long-form responses to open-ended prompts. In this work, we introduce semantic isotropy -- the degree of uniformity across normalized text embeddings on the unit sphere -- and use it to assess the trustworthiness of long-form responses generated by LLMs. To do so, we generate several long-form responses, embed them, and estimate the level of semantic isotropy of these responses as the angular dispersion of the embeddings on the unit sphere. We find that higher semantic isotropy -- that is, greater embedding dispersion -- reliably signals lower factual consistency across samples. Our approach requires no labeled data, no fine-tuning, and no hyperparameter selection, and can be used with open- or closed-weight embedding models. Across multiple domains, our method consistently outperforms existing approaches in predicting nonfactuality in long-form responses using only a handful of samples -- offering a practical, low-cost approach for integrating trust assessment into real-world LLM workflows.

翻译：为了将大型语言模型（LLMs）部署在需要针对开放式提示提供实质性准确回应的高风险应用领域，我们需要可靠且计算成本低廉的方法来评估LLMs生成长文本回应的可信度。然而，现有方法通常依赖于逐项声明的事实核查，这在处理开放式提示的长文本回应时计算成本高昂且脆弱。本研究引入语义各向同性——即单位球面上归一化文本嵌入的均匀程度——并利用其评估LLMs生成长文本回应的可信度。具体而言，我们生成若干长文本回应，将其嵌入向量化，并通过单位球面上嵌入向量的角分散度估计这些回应的语义各向同性水平。我们发现更高的语义各向同性（即更大的嵌入分散度）能够可靠地指示样本间较低的事实一致性。该方法无需标注数据、微调或超参数选择，且适用于开源或闭源权重的嵌入模型。在多个领域中，本方法仅需少量样本即可在预测长文本回应的非事实性方面持续优于现有方法，为将信任评估整合到实际LLM工作流程提供了一种实用、低成本的解决方案。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日