Visual hallucination detection in large vision-language models via evidential conflict

Despite the remarkable multimodal capabilities of Large Vision-Language Models (LVLMs), discrepancies often occur between visual inputs and textual outputs--a phenomenon we term visual hallucination. This critical reliability gap poses substantial risks in safety-critical Artificial Intelligence (AI) applications, necessitating a comprehensive evaluation benchmark and effective detection methods. Firstly, we observe that existing visual-centric hallucination benchmarks mainly assess LVLMs from a perception perspective, overlooking hallucinations arising from advanced reasoning capabilities. We develop the Perception-Reasoning Evaluation Hallucination (PRE-HAL) dataset, which enables the systematic evaluation of both perception and reasoning capabilities of LVLMs across multiple visual semantics, such as instances, scenes, and relations. Comprehensive evaluation with this new benchmark exposed more visual vulnerabilities, particularly in the more challenging task of relation reasoning. To address this issue, we propose, to the best of our knowledge, the first Dempster-Shafer theory (DST)-based visual hallucination detection method for LVLMs through uncertainty estimation. This method aims to efficiently capture the degree of conflict in high-level features at the model inference phase. Specifically, our approach employs simple mass functions to mitigate the computational complexity of evidence combination on power sets. We conduct an extensive evaluation of state-of-the-art LVLMs, LLaVA-v1.5, mPLUG-Owl2 and mPLUG-Owl3, with the new PRE-HAL benchmark. Experimental results indicate that our method outperforms five baseline uncertainty metrics, achieving average AUROC improvements of 4%, 10%, and 7% across three LVLMs. Our code is available at https://github.com/HT86159/Evidential-Conflict.

翻译：尽管大型视觉语言模型（LVLMs）展现出卓越的多模态能力，但视觉输入与文本输出之间常存在不一致——我们将此现象称为视觉幻觉。这一关键的可信度差距在安全关键的人工智能（AI）应用中构成重大风险，亟需建立全面的评估基准和有效的检测方法。首先，我们观察到现有的以视觉为中心的幻觉基准主要从感知角度评估LVLMs，忽视了由高级推理能力引发的幻觉。我们开发了感知-推理评估幻觉（PRE-HAL）数据集，该数据集能够系统评估LVLMs在多种视觉语义（如实例、场景和关系）上的感知与推理能力。使用这一新基准进行的全面评估揭示了更多的视觉脆弱性，尤其是在更具挑战性的关系推理任务中。为解决此问题，据我们所知，我们首次提出了一种基于Dempster-Shafer理论（DST）的、通过不确定性估计进行LVLMs视觉幻觉检测的方法。该方法旨在模型推理阶段高效捕获高层特征中的冲突程度。具体而言，我们的方法采用简单的质量函数来降低幂集上证据组合的计算复杂度。我们使用新的PRE-HAL基准对最先进的LVLMs（LLaVA-v1.5、mPLUG-Owl2和mPLUG-Owl3）进行了广泛评估。实验结果表明，我们的方法优于五种基线不确定性度量，在三种LVLMs上平均AUROC分别提升了4%、10%和7%。我们的代码可在 https://github.com/HT86159/Evidential-Conflict 获取。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【AAAI2022】面向多标签分类的端到端概率标签特征学习

专知会员服务

32+阅读 · 2022年1月27日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日