The rapid spread of multimodal fake news poses a serious societal threat, as its evolving nature and reliance on timely factual details challenge existing detection methods. Dynamic Retrieval-Augmented Generation provides a promising solution by triggering keyword-based retrieval and incorporating external knowledge, thus enabling both efficient and accurate evidence selection. However, it still faces challenges in addressing issues such as redundant retrieval, coarse similarity, and irrelevant evidence when applied to deceptive content. In this paper, we propose ExDR, an Explanation-driven Dynamic Retrieval-Augmented Generation framework for Multimodal Fake News Detection. Our framework systematically leverages model-generated explanations in both the retrieval triggering and evidence retrieval modules. It assesses triggering confidence from three complementary dimensions, constructs entity-aware indices by fusing deceptive entities, and retrieves contrastive evidence based on deception-specific features to challenge the initial claim and enhance the final prediction. Experiments on two benchmark datasets, AMG and MR2, demonstrate that ExDR consistently outperforms previous methods in retrieval triggering accuracy, retrieval quality, and overall detection performance, highlighting its effectiveness and generalization capability.
翻译:多模态虚假新闻的快速传播构成了严重的社会威胁,其不断演变的特性以及对时效性事实细节的依赖,对现有检测方法提出了挑战。动态检索增强生成通过触发基于关键词的检索并整合外部知识,提供了一种有前景的解决方案,从而实现了高效且准确的证据选择。然而,在应用于欺骗性内容时,该方法仍面临冗余检索、相似度计算粗糙以及证据不相关等问题的挑战。本文提出ExDR,一个面向多模态虚假新闻检测的解释驱动动态检索增强生成框架。我们的框架系统性地利用模型生成的解释,将其应用于检索触发和证据检索两个模块。该框架从三个互补维度评估触发置信度,通过融合欺骗性实体构建实体感知索引,并基于欺骗特异性特征检索对比性证据,以质疑初始声明并增强最终预测。在AMG和MR2两个基准数据集上的实验表明,ExDR在检索触发准确性、检索质量和整体检测性能方面均持续优于先前方法,凸显了其有效性和泛化能力。