Natural language processing research has begun to embrace the notion of annotator subjectivity, motivated by variations in labelling. This approach understands each annotator's view as valid, which can be highly suitable for tasks that embed subjectivity, e.g., sentiment analysis. However, this construction may be inappropriate for tasks such as hate speech detection, as it affords equal validity to all positions on e.g., sexism or racism. We argue that the conflation of hate and offence can invalidate findings on hate speech, and call for future work to be situated in theory, disentangling hate from its orthogonal concept, offence.
翻译:自然语言处理研究已开始接纳注释者主观性的概念,其动机源于标注中的差异。该方法认为每位注释者的观点均具有效性,这高度适用于嵌入主观性的任务(如情感分析)。然而,这种建构可能不适用于仇恨言论检测等任务,因为它对性别歧视或种族歧视等立场赋予了同等有效性。我们认为,仇恨与冒犯的混淆会损害仇恨言论研究结论的有效性,并呼吁未来研究应植根于理论,将仇恨与其正交概念——即冒犯——区分开来。