The proliferation of bias and propaganda on social media is an increasingly significant concern, leading to the development of techniques for automatic detection. This article presents a multilingual corpus of 12, 000 Facebook posts fully annotated for bias and propaganda. The corpus was created as part of the FigNews 2024 Shared Task on News Media Narratives for framing the Israeli War on Gaza. It covers various events during the War from October 7, 2023 to January 31, 2024. The corpus comprises 12, 000 posts in five languages (Arabic, Hebrew, English, French, and Hindi), with 2, 400 posts for each language. The annotation process involved 10 graduate students specializing in Law. The Inter-Annotator Agreement (IAA) was used to evaluate the annotations of the corpus, with an average IAA of 80.8% for bias and 70.15% for propaganda annotations. Our team was ranked among the bestperforming teams in both Bias and Propaganda subtasks. The corpus is open-source and available at https://sina.birzeit.edu/fada
翻译:社交媒体上偏见与宣传的泛滥日益成为一个重要关切,这推动了自动检测技术的发展。本文介绍了一个包含 12,000 条 Facebook 帖子的多语言语料库,这些帖子已针对偏见和宣传进行了完整标注。该语料库是作为 FigNews 2024 关于“以色列-加沙战争”新闻媒体叙事框架的共享任务的一部分创建的。它涵盖了从 2023 年 10 月 7 日至 2024 年 1 月 31 日期间战争中的各类事件。语料库包含五种语言(阿拉伯语、希伯来语、英语、法语和印地语)的 12,000 条帖子,每种语言 2,400 条。标注过程由 10 名法学专业的研究生完成。采用标注者间一致性来评估语料库的标注质量,偏见标注的平均 IAA 为 80.8%,宣传标注的平均 IAA 为 70.15%。我们的团队在偏见和宣传两个子任务中均位列表现最佳的团队之一。该语料库是开源的,可通过 https://sina.birzeit.edu/fada 获取。