Representational Harms in LLM-Generated Narratives Against Global Majority Nationalities

Large language models (LLMs) are increasingly used for text generation tasks from everyday use to high-stakes enterprise and government applications, including simulated interviews with asylum seekers. While many works highlight the new potential applications of LLMs, there are risks of LLMs encoding and perpetuating harmful biases about non-dominant communities across the globe. To better evaluate and mitigate such harms, more research examining how LLMs portray diverse individuals is needed. In this work, we study how national origin identities are portrayed by widely-adopted LLMs in response to open-ended narrative generation prompts. Our findings demonstrate the presence of persistent representational harms by national origin, including harmful stereotypes, erasure, and one-dimensional portrayals of Global Majority identities. Minoritized national identities are simultaneously underrepresented in power-neutral stories and overrepresented in subordinated character portrayals, which are over fifty times more likely to appear than dominant portrayals. The degree of harm is amplified when US nationality cues (e.g., ``American'') are present in input prompts. Notably, we find that the harms we identify cannot be explained away via sycophancy, as US-centric biases persist even when replacing US nationality cues with non-US national identities in the prompts. Based on our findings, we call for further exploration of cultural harms in LLMs through methodologies that center Global Majority perspectives and challenge the uncritical adoption of US-based LLMs for the classification, surveillance, and misrepresentation of the majority of our planet.

翻译：大型语言模型（LLMs）日益被用于从日常使用到高风险企业和政府应用的文本生成任务，包括对寻求庇护者的模拟面谈。尽管许多研究强调了LLMs的新潜在应用，但存在LLMs编码并延续针对全球非主导群体有害偏见的风险。为更好地评估和缓解此类伤害，需要更多研究探讨LLMs如何描绘不同个体。在本工作中，我们研究了广泛采用的LLMs在回应开放式叙事生成提示时，如何描绘国家起源身份。我们的发现表明，针对国家起源存在持续的表征性伤害，包括有害刻板印象、抹除以及对全球多数民族身份的单一维度描绘。少数族裔国家身份在权力中性叙事中的代表性不足，同时在从属角色描绘中的代表性过度——后者出现的可能性是主导性描绘的五十倍以上。当输入提示中出现美国国籍线索（如"美国人"）时，伤害程度进一步加剧。值得注意的是，我们发现所识别的伤害无法通过谄媚效应解释，因为即便将提示中的美国国籍线索替换为非美国国家身份，以美国为中心的偏见依然存在。基于我们的发现，我们呼吁通过聚焦全球多数民族视角的方法论，进一步探索LLMs中的文化伤害，并质疑不加批判地将基于美国的LLMs用于对我们星球上多数人群进行分类、监视及错误表征的做法。

相关内容

大语言模型

关注 66

大语言模型是基于海量文本数据训练的深度学习模型。它不仅能够生成自然语言文本，还能够深入理解文本含义，处理各种自然语言任务，如文本摘要、问答、翻译等。2023年，大语言模型及其在人工智能领域的应用已成为全球科技研究的热点，其在规模上的增长尤为引人注目，参数量已从最初的十几亿跃升到如今的一万亿。参数量的提升使得模型能够更加精细地捕捉人类语言微妙之处，更加深入地理解人类语言的复杂性。在过去的一年里，大语言模型在吸纳新知识、分解复杂任务以及图文对齐等多方面都有显著提升。随着技术的不断成熟，它将不断拓展其应用范围，为人类提供更加智能化和个性化的服务，进一步改善人们的生活和生产方式。

大型语言模型（LLM）智能体全栈安全的综述：数据、训练与部署

专知会员服务

33+阅读 · 2025年4月23日

揭示生成式人工智能 / 大型语言模型（LLMs）的军事潜力

专知会员服务

32+阅读 · 2024年9月26日

《LLM 时代小模型的作用》综述

专知会员服务

49+阅读 · 2024年9月12日

基于大语言模型（LLM）的合成数据生成、策展和评估的综述

专知会员服务

62+阅读 · 2024年7月5日