ViSAGe: A Global-Scale Analysis of Visual Stereotypes in Text-to-Image Generation

Recent studies have shown that Text-to-Image (T2I) model generations can reflect social stereotypes present in the real world. However, existing approaches for evaluating stereotypes have a noticeable lack of coverage of global identity groups and their associated stereotypes. To address this gap, we introduce the ViSAGe (Visual Stereotypes Around the Globe) dataset to enable the evaluation of known nationality-based stereotypes in T2I models, across 135 nationalities. We enrich an existing textual stereotype resource by distinguishing between stereotypical associations that are more likely to have visual depictions, such as `sombrero', from those that are less visually concrete, such as 'attractive'. We demonstrate ViSAGe's utility through a multi-faceted evaluation of T2I generations. First, we show that stereotypical attributes in ViSAGe are thrice as likely to be present in generated images of corresponding identities as compared to other attributes, and that the offensiveness of these depictions is especially higher for identities from Africa, South America, and South East Asia. Second, we assess the stereotypical pull of visual depictions of identity groups, which reveals how the 'default' representations of all identity groups in ViSAGe have a pull towards stereotypical depictions, and that this pull is even more prominent for identity groups from the Global South. CONTENT WARNING: Some examples contain offensive stereotypes.

翻译：近期研究表明，文本到图像（T2I）模型生成的内容能够反映现实世界中存在的社会刻板印象。然而，现有的刻板印象评估方法明显缺乏对全球身份群体及其相关刻板印象的覆盖。为弥补这一空白，我们引入了ViSAGe（全球视觉刻板印象）数据集，以评估T2I模型中基于135个国籍的已知刻板印象。我们通过区分更可能具有视觉描绘的刻板印象关联（如“宽边帽”）与视觉上较不具体的关联（如“有吸引力的”），对现有文本刻板印象资源进行了丰富。我们通过多维度评估T2I生成结果来证明ViSAGe的实用性。首先，我们发现ViSAGe中的刻板印象属性在对应身份生成图像中出现的可能性是其他属性的三倍，且这些描绘的冒犯性对来自非洲、南美洲和东南亚的身份群体尤为突出。其次，我们评估了身份群体视觉描绘的刻板印象吸引力，揭示了ViSAGe中所有身份群体的“默认”表征均存在向刻板描绘靠拢的趋势，且这种趋势对全球南方身份群体更为显著。内容警示：部分示例包含冒犯性刻板印象。

相关内容

GROUP

关注 1

Group一直是研究计算机支持的合作工作、人机交互、计算机支持的协作学习和社会技术研究的主要场所。该会议将社会科学、计算机科学、工程、设计、价值观以及其他与小组工作相关的多个不同主题的工作结合起来，并进行了广泛的概念化。官网链接：https://group.acm.org/conferences/group20/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日