On February 24, 2022, Russia invaded Ukraine. In the days that followed, reports kept flooding in from layman to news anchors of a conflict quickly escalating into war. Russia faced immediate backlash and condemnation from the world at large. While the war continues to contribute to an ongoing humanitarian and refugee crisis in Ukraine, a second battlefield has emerged in the online space, both in the use of social media to garner support for both sides of the conflict and also in the context of information warfare. In this paper, we present a collection of over 63 million tweets, from February 22, 2022 through March 8, 2022 that we are publishing for the wider research community to use. This dataset can be found at https://github.com/echen102/ukraine-russia and will be maintained and regularly updated as the war continues to unfold. Our preliminary analysis already shows evidence of public engagement with Russian state sponsored media and other domains that are known to push unreliable information; the former saw a spike in activity on the day of the Russian invasion. Our hope is that this public dataset can help the research community to further understand the ever evolving role that social media plays in information dissemination, influence campaigns, grassroots mobilization, and much more, during a time of conflict.
翻译:2022年2月24日,俄罗斯入侵乌克兰。随后的几天里,从普通民众到新闻主播,关于这场迅速升级为战争的冲突的报道不断涌入。俄罗斯立即遭到国际社会的强烈反对和谴责。虽然战争持续加剧乌克兰境内的人道主义危机和难民危机,但第二个战场已在网络空间浮现——这既体现在社交媒体被用于为冲突双方争取支持,也体现在信息战的语境中。本文呈现了从2022年2月22日至3月8日收集的超过6300万条推文数据集,特此发布供更广泛的研究群体使用。该数据集可通过https://github.com/echen102/ukraine-russia获取,并将随战争进展持续维护和定期更新。我们的初步分析已揭示公众与俄罗斯官方媒体及其他以传播不可靠信息闻名的域名的互动证据:前者在俄罗斯入侵当日出现活动激增。我们期望这个公开数据集能帮助研究群体进一步理解,在冲突时期,社交媒体在信息传播、影响力操作、草根动员等领域不断演化的作用。