The Effect of Group Status on the Variability of Group Representations in LLM-generated Text

Large Language Models (LLMs) have become pervasive in everyday life, yet their inner workings remain opaque. While scholarly efforts have demonstrated LLMs' propensity to reproduce biases in their training data, they have primarily focused on the association of social groups with stereotypic attributes. In this paper, we extend this line of inquiry to investigate a bias akin to the social-psychological phenomenon where socially dominant groups are perceived to be less homogeneous than socially subordinate groups as it is reproduced by LLMs. We had ChatGPT, a state-of-the-art LLM, generate a diversity of texts about intersectional group identities and compared text homogeneity. We consistently find that LLMs portray African, Asian, and Hispanic Americans as more homogeneous than White Americans. They also portray women as more homogeneous than men, but these differences are small. Finally, we find that the effect of gender differs across racial/ethnic groups such that the effect of gender is consistent within African and Hispanic Americans but not within Asian and White Americans. We speculate possible sources of this bias in LLMs and posit that the bias has the potential to amplify biases in future LLM training and to reinforce stereotypes.

翻译：大型语言模型（LLMs）已深入日常生活，但其内部机制仍不透明。尽管学术界已证实LLMs会复现训练数据中的偏见，但现有研究主要聚焦于社会群体与刻板属性之间的关联。本文将研究拓展至社会心理学现象：LLMs可能复现一种偏见——即社会优势群体相较于从属群体被感知为更具异质性。我们让前沿大语言模型ChatGPT生成关于交叉群体身份的多样化文本，并比较文本同质性。结果一致发现：LLMs将非裔、亚裔及西班牙裔美国人描述得比白人美国人更具同质性；将女性描述得比男性更具同质性（但差异较小）。此外，性别效应在不同种族/族群中存在差异：非裔和西班牙裔美国人群体中性别效应具有一致性，但在亚裔和白人群体中则不显著。我们推测了LLMs产生这种偏见的潜在来源，并指出该偏见可能放大未来LLM训练中的偏差，同时强化刻板印象。

相关内容

GROUP

关注 1

Group一直是研究计算机支持的合作工作、人机交互、计算机支持的协作学习和社会技术研究的主要场所。该会议将社会科学、计算机科学、工程、设计、价值观以及其他与小组工作相关的多个不同主题的工作结合起来，并进行了广泛的概念化。官网链接：https://group.acm.org/conferences/group20/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日