Large Language Models Portray Socially Subordinate Groups as More Homogeneous, Consistent with a Bias Observed in Humans

Large language models (LLMs) are becoming pervasive in everyday life, yet their propensity to reproduce biases inherited from training data remains a pressing concern. Prior investigations into bias in LLMs have focused on the association of social groups with stereotypical attributes. However, this is only one form of human bias such systems may reproduce. We investigate a new form of bias in LLMs that resembles a social psychological phenomenon where socially subordinate groups are perceived as more homogeneous than socially dominant groups. We had ChatGPT, a state-of-the-art LLM, generate texts about intersectional group identities and compared those texts on measures of homogeneity. We consistently found that ChatGPT portrayed African, Asian, and Hispanic Americans as more homogeneous than White Americans, indicating that the model described racial minority groups with a narrower range of human experience. ChatGPT also portrayed women as more homogeneous than men, but these differences were small. Finally, we found that the effect of gender differed across racial/ethnic groups such that the effect of gender was consistent within African and Hispanic Americans but not within Asian and White Americans. We argue that the tendency of LLMs to describe groups as less diverse risks perpetuating stereotypes and discriminatory behavior.

翻译：大型语言模型（LLM）在日常生活中的应用日益广泛，但其从训练数据中继承偏见的问题仍令人担忧。此前对LLM偏见的研究主要聚焦于社会群体与刻板印象属性的关联，但这仅是此类系统可能复制的其中一种人类偏见形式。我们探究了LLM中一种新型偏见，该偏见与一种社会心理现象类似：社会弱势群体被感知为比社会优势群体更具同质性。我们使用当前最先进的LLM——ChatGPT——生成关于交叉群体身份的描述文本，并在同质性指标维度上进行比较。结果一致发现：ChatGPT将非裔、亚裔及西班牙裔美国人描绘得比白种美国人更具同质性，表明该模型用更狭隘的人类经验范围描述少数种族群体。ChatGPT还描述女性比男性更具同质性，但差异较小。最后我们发现，性别效应在种族/民族群体间存在差异：性别效应在非裔与西班牙裔美国人内保持稳定，但在亚裔与白种美国人中并不显著。我们认为，LLM倾向将群体描述为缺乏多样性，这可能导致刻板印象与歧视行为的长期存续。

相关内容

GROUP

关注 1

Group一直是研究计算机支持的合作工作、人机交互、计算机支持的协作学习和社会技术研究的主要场所。该会议将社会科学、计算机科学、工程、设计、价值观以及其他与小组工作相关的多个不同主题的工作结合起来，并进行了广泛的概念化。官网链接：https://group.acm.org/conferences/group20/

语言视觉预训练语言模型揭密，Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

专知会员服务

36+阅读 · 2020年5月20日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日