ViSAGe: A Global-Scale Analysis of Visual Stereotypes in Text-to-Image Generation

Recent studies have shown that Text-to-Image (T2I) model generations can reflect social stereotypes present in the real world. However, existing approaches for evaluating stereotypes have a noticeable lack of coverage of global identity groups and their associated stereotypes. To address this gap, we introduce the ViSAGe (Visual Stereotypes Around the Globe) dataset to enable the evaluation of known nationality-based stereotypes in T2I models, across 135 nationalities. We enrich an existing textual stereotype resource by distinguishing between stereotypical associations that are more likely to have visual depictions, such as `sombrero', from those that are less visually concrete, such as 'attractive'. We demonstrate ViSAGe's utility through a multi-faceted evaluation of T2I generations. First, we show that stereotypical attributes in ViSAGe are thrice as likely to be present in generated images of corresponding identities as compared to other attributes, and that the offensiveness of these depictions is especially higher for identities from Africa, South America, and South East Asia. Second, we assess the stereotypical pull of visual depictions of identity groups, which reveals how the 'default' representations of all identity groups in ViSAGe have a pull towards stereotypical depictions, and that this pull is even more prominent for identity groups from the Global South. CONTENT WARNING: Some examples contain offensive stereotypes.

翻译：近期研究表明，文本到图像（T2I）模型生成内容可能反映现实世界中的社会刻板印象。然而，现有刻板印象评估方法在全球身份群体及其关联刻板印象的覆盖范围上存在显著不足。为填补这一空白，我们提出了ViSAGe（全球视觉刻板印象）数据集，可针对135个国籍评估T2I模型中基于国籍的已知刻板印象。通过区分更可能具有视觉描绘（如"宽边帽"）与视觉具象度较低（如"有魅力"）的刻板关联，我们对现有文本刻板印象资源进行了增强。通过T2I生成内容的多维度评估，我们证明了ViSAGe的实用性。首先，与其它属性相比，ViSAGe中的刻板属性在对应身份群体的生成图像中出现频率高出三倍，且这些描绘的攻击性对非洲、南美及东南亚身份群体尤为突出。其次，我们评估了身份群体视觉描绘的刻板倾向性，这揭示了ViSAGe中所有身份群体的"默认"表征均存在向刻板描绘的倾向，且这种倾向对全球南方身份群体更为显著。内容警告：部分示例包含冒犯性刻板印象。

相关内容

GROUP

关注 1

Group一直是研究计算机支持的合作工作、人机交互、计算机支持的协作学习和社会技术研究的主要场所。该会议将社会科学、计算机科学、工程、设计、价值观以及其他与小组工作相关的多个不同主题的工作结合起来，并进行了广泛的概念化。官网链接：https://group.acm.org/conferences/group20/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日