Analyzing the Impact of Fake News on the Anticipated Outcome of the 2024 Election Ahead of Time

Despite increasing awareness and research around fake news, there is still a significant need for datasets that specifically target racial slurs and biases within North American political speeches. This is particulary important in the context of upcoming North American elections. This study introduces a comprehensive dataset that illuminates these critical aspects of misinformation. To develop this fake news dataset, we scraped and built a corpus of 40,000 news articles about political discourses in North America. A portion of this dataset (4000) was then carefully annotated, using a blend of advanced language models and human verification methods. We have made both these datasets openly available to the research community and have conducted benchmarking on the annotated data to demonstrate its utility. We release the best-performing language model along with data. We encourage researchers and developers to make use of this dataset and contribute to this ongoing initiative.

翻译：摘要：尽管围绕虚假新闻的认知和研究日益增多，但针对北美政治演讲中种族歧视言论与偏见的数据集仍存在显著缺口。在即将到来的北美大选背景下，这一问题尤为突出。本研究引入了一个揭示虚假信息关键维度的综合性数据集。为构建该虚假新闻数据集，我们爬取并建立了包含4万篇北美政治话语新闻文章的语料库。通过结合先进语言模型与人工验证方法，我们对其中4000篇文章进行了精细标注。我们已向研究社区公开这两个数据集，并对标注数据开展了基准测试以验证其实用性。同时发布性能最优的语言模型及配套数据。我们鼓励研究人员和开发者使用该数据集，共同推进此项持续研究。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日