In this paper, we discuss the development of an annotation schema to build datasets for evaluating the offline harm potential of social media texts. We define "harm potential" as the potential for an online public post to cause real-world physical harm (i.e., violence). Understanding that real-world violence is often spurred by a web of triggers, often combining several online tactics and pre-existing intersectional fissures in the social milieu, to result in targeted physical violence, we do not focus on any single divisive aspect (i.e., caste, gender, religion, or other identities of the victim and perpetrators) nor do we focus on just hate speech or mis/dis-information. Rather, our understanding of the intersectional causes of such triggers focuses our attempt at measuring the harm potential of online content, irrespective of whether it is hateful or not. In this paper, we discuss the development of a framework/annotation schema that allows annotating the data with different aspects of the text including its socio-political grounding and intent of the speaker (as expressed through mood and modality) that together contribute to it being a trigger for offline harm. We also give a comparative analysis and mapping of our framework with some of the existing frameworks.
翻译:本文讨论了一种用于构建数据集以评估社交媒体文本离线危害潜力的标注方案开发。我们将"危害潜力"定义为在线公共帖子引发现实世界人身伤害(即暴力行为)的可能性。基于现实暴力往往由多重触发因素交织引发——这些因素通常结合多种在线策略与社会环境中既存的交叉性裂隙,最终导致针对性的肢体暴力——我们既不聚焦于单一分裂性维度(如受害者与施害者的种姓、性别、宗教或其他身份),也不局限于仇恨言论或错误/虚假信息。相反,我们对触发因素交叉性成因的理解,促使我们致力于衡量在线内容的危害潜力,无论其是否包含仇恨内容。本文讨论了一个框架/标注方案的开发过程,该方案能够从文本的多重维度进行标注,包括其社会政治基础及说话者意图(通过语气与情态表达),这些维度共同构成触发离线危害的因素。我们还对本文框架与现有部分框架进行了比较分析与映射。