Diverse, but Divisive: LLMs Can Exaggerate Gender Differences in Opinion Related to Harms of Misinformation

The pervasive spread of misinformation and disinformation poses a significant threat to society. Professional fact-checkers play a key role in addressing this threat, but the vast scale of the problem forces them to prioritize their limited resources. This prioritization may consider a range of factors, such as varying risks of harm posed to specific groups of people. In this work, we investigate potential implications of using a large language model (LLM) to facilitate such prioritization. Because fact-checking impacts a wide range of diverse segments of society, it is important that diverse views are represented in the claim prioritization process. This paper examines whether a LLM can reflect the views of various groups when assessing the harms of misinformation, focusing on gender as a primary variable. We pose two central questions: (1) To what extent do prompts with explicit gender references reflect gender differences in opinion in the United States on topics of social relevance? and (2) To what extent do gender-neutral prompts align with gendered viewpoints on those topics? To analyze these questions, we present the TopicMisinfo dataset, containing 160 fact-checked claims from diverse topics, supplemented by nearly 1600 human annotations with subjective perceptions and annotator demographics. Analyzing responses to gender-specific and neutral prompts, we find that GPT 3.5-Turbo reflects empirically observed gender differences in opinion but amplifies the extent of these differences. These findings illuminate AI's complex role in moderating online communication, with implications for fact-checkers, algorithm designers, and the use of crowd-workers as annotators. We also release the TopicMisinfo dataset to support continuing research in the community.

翻译：虚假信息和错误信息的普遍传播对社会构成重大威胁。专业的事实核查人员在应对这一威胁中扮演关键角色，但问题的巨大规模迫使他们将有限的资源优先分配。这种优先分配可能考虑一系列因素，例如针对特定人群的不同危害风险。在本研究中，我们探讨使用大语言模型（LLM）促进此类优先分配的潜在影响。由于事实核查影响社会中广泛多样的群体，因此在主张优先排序过程中反映多元观点至关重要。本文考察LLM在评估虚假信息危害时能否反映不同群体的观点，重点关注性别作为主要变量。我们提出两个核心问题：（1）带有明确性别指代的提示在多大程度上反映了美国社会中与社交议题相关的性别观点差异？（2）性别中立的提示在多大程度上与这些议题上的性别视角一致？为分析这些问题，我们提出了TopicMisinfo数据集，包含来自不同主题的160个经过事实核查的主张，附有近1600条包含主观认知和注释者人口统计信息的人工标注。通过分析针对特定性别和中性提示的响应，我们发现GPT 3.5-Turbo能反映经验观察到的性别观点差异，但夸大了这些差异的程度。这些发现揭示了人工智能在调节在线交流中的复杂作用，对事实核查人员、算法设计人员以及使用众包工作者作为注释者具有启示意义。我们还发布了TopicMisinfo数据集，以支持社区内的持续研究。

相关内容

大语言模型

关注 66

大语言模型是基于海量文本数据训练的深度学习模型。它不仅能够生成自然语言文本，还能够深入理解文本含义，处理各种自然语言任务，如文本摘要、问答、翻译等。2023年，大语言模型及其在人工智能领域的应用已成为全球科技研究的热点，其在规模上的增长尤为引人注目，参数量已从最初的十几亿跃升到如今的一万亿。参数量的提升使得模型能够更加精细地捕捉人类语言微妙之处，更加深入地理解人类语言的复杂性。在过去的一年里，大语言模型在吸纳新知识、分解复杂任务以及图文对齐等多方面都有显著提升。随着技术的不断成熟，它将不断拓展其应用范围，为人类提供更加智能化和个性化的服务，进一步改善人们的生活和生产方式。

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日