Less is More: Unseen Domain Fake News Detection via Causal Propagation Substructures

The spread of fake news on social media poses significant threats to individuals and society. Text-based and graph-based models have been employed for fake news detection by analysing news content and propagation networks, showing promising results in specific scenarios. However, these data-driven models heavily rely on pre-existing in-distribution data for training, limiting their performance when confronted with fake news from emerging or previously unseen domains, known as out-of-distribution (OOD) data. Tackling OOD fake news is a challenging yet critical task. In this paper, we introduce the Causal Subgraph-oriented Domain Adaptive Fake News Detection (CSDA) model, designed to enhance zero-shot fake news detection by extracting causal substructures from propagation graphs using in-distribution data and generalising this approach to OOD data. The model employs a graph neural network based mask generation process to identify dominant nodes and edges within the propagation graph, using these substructures for fake news detection. Additionally, the performance of CSDA is further improved through contrastive learning in few-shot scenarios, where a limited amount of OOD data is available for training. Extensive experiments on public social media datasets demonstrate that CSDA effectively handles OOD fake news detection, achieving a 7 to 16 percents accuracy improvement over other state-of-the-art models.

翻译：社交媒体上虚假新闻的传播对个人和社会构成重大威胁。基于文本和基于图表的模型通过分析新闻内容和传播网络已被用于虚假新闻检测，在特定场景中显示出良好效果。然而，这些数据驱动模型严重依赖预先存在的同分布数据进行训练，当面对来自新兴或先前未见领域（即异分布数据）的虚假新闻时，其性能受到限制。处理异分布虚假新闻是一项具有挑战性但至关重要的任务。本文提出了面向因果子结构的领域自适应虚假新闻检测模型，该模型旨在通过使用同分布数据从传播图中提取因果子结构，并将此方法推广至异分布数据，从而增强零样本虚假新闻检测能力。该模型采用基于图神经网络的掩码生成过程来识别传播图中的主导节点和边，并利用这些子结构进行虚假新闻检测。此外，在少量样本场景中，当仅有有限数量的异分布数据可用于训练时，通过对比学习进一步提升了CSDA的性能。在公开社交媒体数据集上的大量实验表明，CSDA能有效处理异分布虚假新闻检测，相比其他最先进模型实现了7%至16%的准确率提升。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日