RAUCG: Retrieval-Augmented Unsupervised Counter Narrative Generation for Hate Speech

The Counter Narrative (CN) is a promising approach to combat online hate speech (HS) without infringing on freedom of speech. In recent years, there has been a growing interest in automatically generating CNs using natural language generation techniques. However, current automatic CN generation methods mainly rely on expert-authored datasets for training, which are time-consuming and labor-intensive to acquire. Furthermore, these methods cannot directly obtain and extend counter-knowledge from external statistics, facts, or examples. To address these limitations, we propose Retrieval-Augmented Unsupervised Counter Narrative Generation (RAUCG) to automatically expand external counter-knowledge and map it into CNs in an unsupervised paradigm. Specifically, we first introduce an SSF retrieval method to retrieve counter-knowledge from the multiple perspectives of stance consistency, semantic overlap rate, and fitness for HS. Then we design an energy-based decoding mechanism by quantizing knowledge injection, countering and fluency constraints into differentiable functions, to enable the model to build mappings from counter-knowledge to CNs without expert-authored CN data. Lastly, we comprehensively evaluate model performance in terms of language quality, toxicity, persuasiveness, relevance, and success rate of countering HS, etc. Experimental results show that RAUCG outperforms strong baselines on all metrics and exhibits stronger generalization capabilities, achieving significant improvements of +2.0% in relevance and +4.5% in success rate of countering metrics. Moreover, RAUCG enabled GPT2 to outperform T0 in all metrics, despite the latter being approximately eight times larger than the former. Warning: This paper may contain offensive or upsetting content!

翻译：反叙事是一种在不侵犯言论自由的前提下对抗在线仇恨言论的有效方法。近年来，利用自然语言生成技术自动生成反叙事的研究日益受到关注。然而，当前自动反叙事生成方法主要依赖专家编写的数据集进行训练，而这类数据集的获取耗时费力。此外，这些方法无法直接从外部统计数据、事实或示例中获取并扩展反驳性知识。为解决上述局限，我们提出检索增强无监督反叙事生成方法，通过无监督范式自动扩展外部反驳性知识并将其映射为反叙事。具体而言，我们首先提出SSF检索方法，从立场一致性、语义重叠率与仇恨言论适应度等多个视角检索反驳性知识；随后设计基于能量的解码机制，通过将知识注入、反驳性与流畅性约束量化为可微函数，使模型能够在无需专家编写的反叙事数据的情况下建立从反驳性知识到反叙事的映射；最后，我们从语言质量、毒性、说服力、相关性及仇恨言论反驳成功率等维度全面评估模型性能。实验结果表明，RAUCG在所有指标上均优于强基线模型，并展现出更强的泛化能力，其中相关性指标提升+2.0%，反驳成功率指标提升+4.5%。此外，RAUCG使GPT2在所有指标上超越参数规模约为其八倍的T0模型。警告：本文可能包含冒犯性或令人不适的内容！

相关内容

中国神经科学学会

关注 0

中国神经科学学会（CNS）是由全国的科研、教学和医院等单位中的神经科学工作者组成的，具有独立法人资格的非营利性社会团体。自2016年起，学会开始致力于神经科学学科引领和学术战略规划。2016-2018年完成了中国科协《神经科学方向预测与技术路线图》项目和《生命科学领域前沿跟踪研究》项目，并且已经由科学出版社正式出版，2020年完成了《神经科学和类脑人工智能发展-新进展新趋势》。2020-2021年还将完成《我国类脑智能产业与技术发展路线图研究》和《科技经济融合发展-智能细胞制造科技创新与产业发展战略研究》。2020年开始学会将每年开展评选年度“中国神经科学重大进展”。中国神经科学学会年会即全国学术会议，是我国神经科学领域规模最大、学术水平最高的学术会议。从2021年开始，改为一年一次，并且与海内外华人神经科学家研讨会结合在一起。学会下属专业分会每年召开形式多样、内容丰富的学术会议和培训班，促进了神经科学领域的学术交流和合作。

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

14+阅读 · 2022年3月12日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日