Stable Bias: Analyzing Societal Representations in Diffusion Models - 专知论文

会员服务 ·

0

有偏 · 多样性 · MoDELS · 表示 · 可辨认的 ·

2023 年 3 月 20 日

Stable Bias: Analyzing Societal Representations in Diffusion Models

翻译：稳定偏差：分析扩散模型中的社会表征

Alexandra Sasha Luccioni,Christopher Akiki,Margaret Mitchell,Yacine Jernite

As machine learning-enabled Text-to-Image (TTI) systems are becoming increasingly prevalent and seeing growing adoption as commercial services, characterizing the social biases they exhibit is a necessary first step to lowering their risk of discriminatory outcomes. This evaluation, however, is made more difficult by the synthetic nature of these systems' outputs; since artificial depictions of fictive humans have no inherent gender or ethnicity nor do they belong to socially-constructed groups, we need to look beyond common categorizations of diversity or representation. To address this need, we propose a new method for exploring and quantifying social biases in TTI systems by directly comparing collections of generated images designed to showcase a system's variation across social attributes -- gender and ethnicity -- and target attributes for bias evaluation -- professions and gender-coded adjectives. Our approach allows us to (i) identify specific bias trends through visualization tools, (ii) provide targeted scores to directly compare models in terms of diversity and representation, and (iii) jointly model interdependent social variables to support a multidimensional analysis. We use this approach to analyze over 96,000 images generated by 3 popular TTI systems (DALL-E 2, Stable Diffusion v 1.4 and v 2) and find that all three significantly over-represent the portion of their latent space associated with whiteness and masculinity across target attributes; among the systems studied, DALL-E 2 shows the least diversity, followed by Stable Diffusion v2 then v1.4.

翻译：随着基于机器学习的文本到图像（TTI）系统日益普及并被广泛采纳为商业服务，刻画其表现出的社会偏见是降低其歧视性结果风险的必要第一步。然而，由于这些系统输出的合成性质，这种评估变得更加困难：由于虚构人物的合成图像既无固有性别或种族，也不属于社会建构的群体，我们需要超越常见的多样性或表征分类。为应对这一需求，我们提出一种新方法，通过直接比较为展示系统在社会属性（性别与种族）以及偏差评估目标属性（职业与性别编码形容词）上变异而设计的生成图像集合，来探索并量化TTI系统中的社会偏见。我们的方法能够（i）通过可视化工具识别具体偏差趋势，（ii）提供针对性评分以直接比较模型在多样性和表征方面的表现，以及（iii）联合建模相互依赖的社会变量以支持多维度分析。我们运用该方法分析了由3个主流TTI系统（DALL-E 2、Stable Diffusion v1.4和v2）生成的超过96,000张图像，并发现这三个系统均显著过度表征其潜在空间中与白人和男性特质相关的部分；在所研究的系统中，DALL-E 2的多样性最低，其次是Stable Diffusion v2，最后是v1.4。

0

相关内容

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

128+阅读 · 2022年4月21日

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

专知会员服务

22+阅读 · 2022年3月7日

【SIGIR2020】一个统一的双视图模型，用于具有不一致性损失的评论总结和情绪分类，A Unified Dual-view Model for Review Summarization and Sentiment Classification with Inconsistency Loss

【SIGIR2020】一个统一的双视图模型，用于具有不一致性损失的评论总结和情绪分类，A Unified Dual-view Model for Review Summarization and Sentiment Classification with Inconsistency Loss

专知会员服务

22+阅读 · 2020年6月3日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

109+阅读 · 2020年5月1日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

【ICLR2020】理解非自回归机器翻译中的知识蒸馏（Understanding Knowledge Distillation in Non-autoregressive Machine Translation）

【ICLR2020】理解非自回归机器翻译中的知识蒸馏（Understanding Knowledge Distillation in Non-autoregressive Machine Translation）

专知会员服务

11+阅读 · 2019年12月28日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

80+阅读 · 2019年10月10日

首个目标检测扩散模型，比Faster R-CNN、DETR好，从随机框中直接检测

首个目标检测扩散模型，比Faster R-CNN、DETR好，从随机框中直接检测

机器之心

1+阅读 · 2022年11月21日

Diffusion Model一发力，GAN就过时了？？？

Diffusion Model一发力，GAN就过时了？？？

量子位

3+阅读 · 2022年8月20日

综述 | 推荐系统偏差与去偏总结

综述 | 推荐系统偏差与去偏总结

机器学习与推荐算法

3+阅读 · 2022年5月11日

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

AINLP

10+阅读 · 2019年2月9日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

上百份文字的检测与识别资源，包含数据集、code和paper

上百份文字的检测与识别资源，包含数据集、code和paper

数据挖掘入门与实战

17+阅读 · 2017年12月7日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

肠道HCN通道在IBS发病中的作用

国家自然科学基金

1+阅读 · 2015年12月31日

卤代功能化离子液体的结构和性质的理论研究及其实验验证

国家自然科学基金

0+阅读 · 2014年12月31日

LncRNA NEAT1在结直肠癌早期诊断与预后预测中的作用及其调控结直肠癌发生发展的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

GaN/AlGaN异质结构中载流子输运性质的时间分辨光谱研究

国家自然科学基金

0+阅读 · 2013年12月31日

胰高血糖素受体核酸适体的筛选、表征及其对葡萄糖稳态的影响

国家自然科学基金

0+阅读 · 2013年12月31日

表面解吸常压化学电离源与离子迁移谱联用的研究

国家自然科学基金

0+阅读 · 2012年12月31日

社会从众的心理和神经机制

国家自然科学基金

2+阅读 · 2012年12月31日

脉冲扰动对微分系统动力学行为的影响及脉冲最优控制问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

RI与Angiogenin相互作用调控PI3K/AKT/mTOR信号通路和ANG的核转位在膀胱癌发生发展中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

氮化硅薄膜的喷射气相淀积制备及其与氮化镓之间界面态研究

国家自然科学基金

0+阅读 · 2008年12月31日

Image-to-Text Translation for Interactive Image Recognition: A Comparative User Study with Non-Expert Users

Arxiv

0+阅读 · 2023年5月11日

Estimating the Personality of White-Box Language Models

Arxiv

0+阅读 · 2023年5月10日

Large Language Models Humanize Technology

Arxiv

0+阅读 · 2023年5月9日

Attack Named Entity Recognition by Entity Boundary Interference

Arxiv

0+阅读 · 2023年5月9日

SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models

Arxiv

0+阅读 · 2023年5月9日

Modelling Concurrency Bugs Using Machine Learning

Arxiv

0+阅读 · 2023年5月8日

SWDPM: A Social Welfare-Optimized Data Pricing Mechanism

Arxiv

0+阅读 · 2023年5月8日

Challenges of Artificial Intelligence -- From Machine Learning and Computer Vision to Emotional Intelligence

Arxiv

19+阅读 · 2022年1月5日

On the Opportunities and Risks of Foundation Models

Arxiv

30+阅读 · 2021年8月18日

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Arxiv

17+阅读 · 2019年10月9日

VIP会员

文章信息

相关主题

最新内容

无人机自主控制与人工智能：系统性综述

无人机自主控制与人工智能：系统性综述

专知会员服务

8+阅读 · 今天7:25

巡飞弹与反无人机系统——现代战场的两大支柱

巡飞弹与反无人机系统——现代战场的两大支柱

专知会员服务

3+阅读 · 今天6:54

《打造“黄金舰队”》57页报告

《打造“黄金舰队”》57页报告

专知会员服务

2+阅读 · 今天6:52

《北约数字教官网络发展路径》128页报告

《北约数字教官网络发展路径》128页报告

专知会员服务

2+阅读 · 今天6:33

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

专知会员服务

7+阅读 · 6月25日

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

专知会员服务

6+阅读 · 6月25日

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

专知会员服务

9+阅读 · 6月25日

网状网络及其在军事领域的运用

网状网络及其在军事领域的运用

专知会员服务

7+阅读 · 6月25日

《意识即战场——全球安全体系中认知战的演进：乌克兰构建认知作战体系的展望》

《意识即战场——全球安全体系中认知战的演进：乌克兰构建认知作战体系的展望》

专知会员服务

8+阅读 · 6月25日

无美国参与的欧洲战争方式（万字长文）

无美国参与的欧洲战争方式（万字长文）

专知会员服务

8+阅读 · 6月25日

重构“下一场战争”的制胜理论：超越兰彻斯特方程与现代系统

重构“下一场战争”的制胜理论：超越兰彻斯特方程与现代系统

专知会员服务

10+阅读 · 6月25日

《国防工业中基于模型定义的实施：产品定义数字化转型的战略路径》90页

《国防工业中基于模型定义的实施：产品定义数字化转型的战略路径》90页

专知会员服务

9+阅读 · 6月25日

《国防领域敏感性分析白皮书》

《国防领域敏感性分析白皮书》

专知会员服务

9+阅读 · 6月25日

综述 | 从问答到任务完成：Agent系统与Harness设计

综述 | 从问答到任务完成：Agent系统与Harness设计

专知会员服务

10+阅读 · 6月24日

Agentic RL：框架、实践与长程智能体训练

Agentic RL：框架、实践与长程智能体训练

专知会员服务

10+阅读 · 6月24日

相关VIP内容

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

128+阅读 · 2022年4月21日

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

专知会员服务

22+阅读 · 2022年3月7日

【SIGIR2020】一个统一的双视图模型，用于具有不一致性损失的评论总结和情绪分类，A Unified Dual-view Model for Review Summarization and Sentiment Classification with Inconsistency Loss

【SIGIR2020】一个统一的双视图模型，用于具有不一致性损失的评论总结和情绪分类，A Unified Dual-view Model for Review Summarization and Sentiment Classification with Inconsistency Loss

专知会员服务

22+阅读 · 2020年6月3日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

109+阅读 · 2020年5月1日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

【ICLR2020】理解非自回归机器翻译中的知识蒸馏（Understanding Knowledge Distillation in Non-autoregressive Machine Translation）

【ICLR2020】理解非自回归机器翻译中的知识蒸馏（Understanding Knowledge Distillation in Non-autoregressive Machine Translation）

专知会员服务

11+阅读 · 2019年12月28日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

80+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

巡飞弹与反无人机系统——现代战场的两大支柱

《北约数字教官网络发展路径》128页报告

无人机自主控制与人工智能：系统性综述

《打造“黄金舰队”》57页报告

相关资讯

首个目标检测扩散模型，比Faster R-CNN、DETR好，从随机框中直接检测

首个目标检测扩散模型，比Faster R-CNN、DETR好，从随机框中直接检测

机器之心

1+阅读 · 2022年11月21日

Diffusion Model一发力，GAN就过时了？？？

Diffusion Model一发力，GAN就过时了？？？

量子位

3+阅读 · 2022年8月20日

综述 | 推荐系统偏差与去偏总结

综述 | 推荐系统偏差与去偏总结

机器学习与推荐算法

3+阅读 · 2022年5月11日

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

NLP 2018 Highlights：2018自然语言处理技术亮点汇总

AINLP

10+阅读 · 2019年2月9日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

上百份文字的检测与识别资源，包含数据集、code和paper

上百份文字的检测与识别资源，包含数据集、code和paper

数据挖掘入门与实战

17+阅读 · 2017年12月7日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Image-to-Text Translation for Interactive Image Recognition: A Comparative User Study with Non-Expert Users

Arxiv

0+阅读 · 2023年5月11日

Estimating the Personality of White-Box Language Models

Arxiv

0+阅读 · 2023年5月10日

Large Language Models Humanize Technology

Arxiv

0+阅读 · 2023年5月9日

Attack Named Entity Recognition by Entity Boundary Interference

Arxiv

0+阅读 · 2023年5月9日

SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models

Arxiv

0+阅读 · 2023年5月9日

Modelling Concurrency Bugs Using Machine Learning

Arxiv

0+阅读 · 2023年5月8日

SWDPM: A Social Welfare-Optimized Data Pricing Mechanism

Arxiv

0+阅读 · 2023年5月8日

Challenges of Artificial Intelligence -- From Machine Learning and Computer Vision to Emotional Intelligence

Arxiv

19+阅读 · 2022年1月5日

On the Opportunities and Risks of Foundation Models

Arxiv

30+阅读 · 2021年8月18日

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Arxiv

17+阅读 · 2019年10月9日

相关基金

肠道HCN通道在IBS发病中的作用

国家自然科学基金

1+阅读 · 2015年12月31日

卤代功能化离子液体的结构和性质的理论研究及其实验验证

国家自然科学基金

0+阅读 · 2014年12月31日

LncRNA NEAT1在结直肠癌早期诊断与预后预测中的作用及其调控结直肠癌发生发展的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

GaN/AlGaN异质结构中载流子输运性质的时间分辨光谱研究

国家自然科学基金

0+阅读 · 2013年12月31日

胰高血糖素受体核酸适体的筛选、表征及其对葡萄糖稳态的影响

国家自然科学基金

0+阅读 · 2013年12月31日

表面解吸常压化学电离源与离子迁移谱联用的研究

国家自然科学基金

0+阅读 · 2012年12月31日

社会从众的心理和神经机制

国家自然科学基金

2+阅读 · 2012年12月31日

脉冲扰动对微分系统动力学行为的影响及脉冲最优控制问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

RI与Angiogenin相互作用调控PI3K/AKT/mTOR信号通路和ANG的核转位在膀胱癌发生发展中的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

氮化硅薄膜的喷射气相淀积制备及其与氮化镓之间界面态研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员