在公平性之前（最优地）修正表征：有限样本收缩总体校正与子群体偏移下的真实公平性代价 (Fix Representation (Optimally) Before Fairness: Finite-Sample Shrinkage Population Correction and the True Price of Fairness Under Subpopulation Shift) - 专知论文

会员服务 ·

0

公平性 · 最优 · 样本 · 收缩 · 代价 ·

Fix Representation (Optimally) Before Fairness: Finite-Sample Shrinkage Population Correction and the True Price of Fairness Under Subpopulation Shift

翻译：在公平性之前（最优地）修正表征：有限样本收缩总体校正与子群体偏移下的真实公平性代价

Amir Asiaee,Kaveh Aryan

Machine learning practitioners frequently observe tension between predictive accuracy and group fairness constraints -- yet sometimes fairness interventions appear to improve accuracy. We show that both phenomena can be artifacts of training data that misrepresents subgroup proportions. Under subpopulation shift (stable within-group distributions, shifted group proportions), we establish: (i) full importance-weighted correction is asymptotically unbiased but finite-sample suboptimal; (ii) the optimal finite-sample correction is a shrinkage reweighting that interpolates between target and training mixtures; (iii) apparent "fairness helps accuracy" can arise from comparing fairness methods to an improperly-weighted baseline. We provide an actionable evaluation protocol: fix representation (optimally) before fairness -- compare fairness interventions against a shrinkage-corrected baseline to isolate the true, irreducible price of fairness. Experiments on synthetic and real-world benchmarks (Adult, COMPAS) validate our theoretical predictions and demonstrate that this protocol eliminates spurious tradeoffs, revealing the genuine fairness-utility frontier.

翻译：机器学习从业者经常观察到预测准确性与群体公平性约束之间的张力——然而有时公平性干预似乎能提高准确性。我们证明这两种现象都可能是训练数据错误表示子群体比例的产物。在子群体偏移（组内分布稳定，组比例偏移）条件下，我们证明：（i）完全重要性加权校正在渐近意义下无偏但有限样本次优；（ii）最优有限样本校正是一种收缩重加权方法，在目标分布与训练混合分布之间进行插值；（iii）表面上的“公平性提升准确性”可能源于将公平性方法与不当加权的基线进行比较。我们提出可操作的评估方案：在公平性之前（最优地）修正表征——将公平性干预与经过收缩校正的基线进行比较，以分离出真实、不可约的公平性代价。在合成与真实基准数据集（Adult、COMPAS）上的实验验证了我们的理论预测，并证明该方案能消除虚假权衡，揭示真实的公平性-效用边界。

0

相关内容

公平性

【普林斯顿博士论文】深度学习优化的隐性偏差：数学考察，391页pdf

【普林斯顿博士论文】深度学习优化的隐性偏差：数学考察，391页pdf

专知会员服务

29+阅读 · 2024年10月4日

【MIT博士论文】序列决策中的算法公平性，134页pdf

【MIT博士论文】序列决策中的算法公平性，134页pdf

专知会员服务

25+阅读 · 2023年5月20日

2022最新MIT成果【ICML 2022】：一种提高人工智能的公平性和准确性的技术，Selective Regression Under Fairness Criteria

2022最新MIT成果【ICML 2022】：一种提高人工智能的公平性和准确性的技术，Selective Regression Under Fairness Criteria

专知会员服务

10+阅读 · 2022年7月26日

【清华大学】Delta调优:预训练语言模型参数有效方法的综合研究，Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

【清华大学】Delta调优:预训练语言模型参数有效方法的综合研究，Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

专知会员服务

26+阅读 · 2022年3月15日

【ICLR 2022 paper解读】将公平性注入机器学习模型，降低模型偏差，即使用于训练模型的数据集是不平衡的

【ICLR 2022 paper解读】将公平性注入机器学习模型，降低模型偏差，即使用于训练模型的数据集是不平衡的

专知会员服务

33+阅读 · 2022年3月10日

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

专知会员服务

22+阅读 · 2022年3月7日

【卡内基梅隆大学-CMU】机器学习中的公平性，Learning Fair Representations

【卡内基梅隆大学-CMU】机器学习中的公平性，Learning Fair Representations

专知会员服务

38+阅读 · 2020年2月29日

【论文推荐WWW2020-UIUC】修正排序系统中的选择偏差：Correcting for Selection Bias in Learning-to-rank Systems

【论文推荐WWW2020-UIUC】修正排序系统中的选择偏差：Correcting for Selection Bias in Learning-to-rank Systems

专知会员服务

32+阅读 · 2020年2月1日

【2020密歇根大学论文】基于学习的序列决策算法的公平性综述论文，Fairness in Learning-Based Sequential Decision Algorithms: A Survey

【2020密歇根大学论文】基于学习的序列决策算法的公平性综述论文，Fairness in Learning-Based Sequential Decision Algorithms: A Survey

专知会员服务

22+阅读 · 2020年1月15日

【NeurIPS2019报告推荐】公平与表示学习—UIUC Sanmi Koyejo教授

【NeurIPS2019报告推荐】公平与表示学习—UIUC Sanmi Koyejo教授

专知会员服务

44+阅读 · 2019年12月24日

最新《迁移学习:域自适应理论》综述论文，128页ppt讲解迁移学习与最优传输

最新《迁移学习:域自适应理论》综述论文，128页ppt讲解迁移学习与最优传输

专知

16+阅读 · 2020年4月27日

《机器学习与公平性》新书发布，附127页PDF下载

《机器学习与公平性》新书发布，附127页PDF下载

专知

24+阅读 · 2019年9月13日

Dropout、梯度消失/爆炸、Adam优化算法，神经网络优化算法看这一篇就够了

Dropout、梯度消失/爆炸、Adam优化算法，神经网络优化算法看这一篇就够了

AI100

14+阅读 · 2019年9月1日

一行TensorFlow/Keras代码解决真实场景中数据不平衡(imbalanced)问题

一行TensorFlow/Keras代码解决真实场景中数据不平衡(imbalanced)问题

专知

78+阅读 · 2019年5月31日

《小样本学习(Few-shot learning)》最新41页综述论文，来自港科大和第四范式

《小样本学习(Few-shot learning)》最新41页综述论文，来自港科大和第四范式

专知

363+阅读 · 2019年4月12日

机器学习中如何处理不平衡数据？

机器学习中如何处理不平衡数据？

机器之心

13+阅读 · 2019年2月17日

详解常见的损失函数

详解常见的损失函数

七月在线实验室

20+阅读 · 2018年7月12日

关于处理样本不平衡问题的Trick整理

关于处理样本不平衡问题的Trick整理

机器学习算法与Python学习

14+阅读 · 2017年12月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

FCS 论坛 | 孟德宇：误差建模原理

FCS 论坛 | 孟德宇：误差建模原理

FCS

15+阅读 · 2017年8月17日

群体偏好的敏感性度量方法研究和群决策方法的可实施性评价

国家自然科学基金

0+阅读 · 2017年12月31日

测量误差数据下部分线性模型有约束统计推断理论

国家自然科学基金

2+阅读 · 2015年12月31日

信息不完全的双边匹配决策方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

半参数回归模型中随机误差分布的检验问题

国家自然科学基金

2+阅读 · 2015年12月31日

排序与半监督学习的误差分析

国家自然科学基金

0+阅读 · 2015年12月31日

若干偏微分方程控制系统的适定正则性及稳定性分析

国家自然科学基金

0+阅读 · 2015年12月31日

基于云计算平台的下一代测序数据错误修正算法研究与实现

国家自然科学基金

2+阅读 · 2015年12月31日

方差正则化的分类模型选择方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

测量误差数据下约束线性模型的有偏估计及变量选择研究

国家自然科学基金

0+阅读 · 2014年12月31日

相依回归模型与扩散过程的统计推断及其应用

国家自然科学基金

1+阅读 · 2014年12月31日

The Statistical Fairness-Accuracy Frontier

Arxiv

0+阅读 · 2月16日

Empirical Likelihood-Based Fairness Auditing: Distribution-Free Certification and Flagging

Arxiv

0+阅读 · 2月12日

Double Fairness Policy Learning: Integrating Action Fairness and Outcome Fairness in Decision-making

Arxiv

0+阅读 · 2月9日

Fair Feature Importance Scores via Feature Occlusion and Permutation

Arxiv

0+阅读 · 2月9日

Fairness Aware Reward Optimization

Arxiv

0+阅读 · 2月8日

Understanding Fairness and Prediction Error through Subspace Decomposition and Influence Analysis

Arxiv

0+阅读 · 2月7日

Maximin Relative Improvement: Fair Learning as a Bargaining Problem

Arxiv

0+阅读 · 2月4日

Post-Training Fairness Control: A Single-Train Framework for Dynamic Fairness in Recommendation

Arxiv

0+阅读 · 1月28日

Empirical Likelihood-Based Fairness Auditing: Distribution-Free Certification and Flagging

Arxiv

0+阅读 · 1月28日

Doubly-Regressing Approach for Subgroup Fairness

Arxiv

0+阅读 · 1月27日

VIP会员

文章信息

相关主题

相关VIP内容

【普林斯顿博士论文】深度学习优化的隐性偏差：数学考察，391页pdf

【普林斯顿博士论文】深度学习优化的隐性偏差：数学考察，391页pdf

专知会员服务

29+阅读 · 2024年10月4日

【MIT博士论文】序列决策中的算法公平性，134页pdf

【MIT博士论文】序列决策中的算法公平性，134页pdf

专知会员服务

25+阅读 · 2023年5月20日

2022最新MIT成果【ICML 2022】：一种提高人工智能的公平性和准确性的技术，Selective Regression Under Fairness Criteria

2022最新MIT成果【ICML 2022】：一种提高人工智能的公平性和准确性的技术，Selective Regression Under Fairness Criteria

专知会员服务

10+阅读 · 2022年7月26日

【清华大学】Delta调优:预训练语言模型参数有效方法的综合研究，Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

【清华大学】Delta调优:预训练语言模型参数有效方法的综合研究，Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

专知会员服务

26+阅读 · 2022年3月15日

【ICLR 2022 paper解读】将公平性注入机器学习模型，降低模型偏差，即使用于训练模型的数据集是不平衡的

【ICLR 2022 paper解读】将公平性注入机器学习模型，降低模型偏差，即使用于训练模型的数据集是不平衡的

专知会员服务

33+阅读 · 2022年3月10日

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

专知会员服务

22+阅读 · 2022年3月7日

【卡内基梅隆大学-CMU】机器学习中的公平性，Learning Fair Representations

【卡内基梅隆大学-CMU】机器学习中的公平性，Learning Fair Representations

专知会员服务

38+阅读 · 2020年2月29日

【论文推荐WWW2020-UIUC】修正排序系统中的选择偏差：Correcting for Selection Bias in Learning-to-rank Systems

【论文推荐WWW2020-UIUC】修正排序系统中的选择偏差：Correcting for Selection Bias in Learning-to-rank Systems

专知会员服务

32+阅读 · 2020年2月1日

【2020密歇根大学论文】基于学习的序列决策算法的公平性综述论文，Fairness in Learning-Based Sequential Decision Algorithms: A Survey

【2020密歇根大学论文】基于学习的序列决策算法的公平性综述论文，Fairness in Learning-Based Sequential Decision Algorithms: A Survey

专知会员服务

22+阅读 · 2020年1月15日

【NeurIPS2019报告推荐】公平与表示学习—UIUC Sanmi Koyejo教授

【NeurIPS2019报告推荐】公平与表示学习—UIUC Sanmi Koyejo教授

专知会员服务

44+阅读 · 2019年12月24日

热门VIP内容

开通专知VIP会员享更多权益服务

《无人机与战争：被忽视的环境影响及无人机保护潜力》

俄罗斯规划未来无人机驱动军队

《整合杀伤链：一个用于边缘目标验证与战术推理的零样本框架》最新资料

《人工智能、武器与影响力：前沿模型在模拟核危机中展现复杂推理》2026最新46页报告

相关资讯

最新《迁移学习:域自适应理论》综述论文，128页ppt讲解迁移学习与最优传输

最新《迁移学习:域自适应理论》综述论文，128页ppt讲解迁移学习与最优传输

专知

16+阅读 · 2020年4月27日

《机器学习与公平性》新书发布，附127页PDF下载

《机器学习与公平性》新书发布，附127页PDF下载

专知

24+阅读 · 2019年9月13日

Dropout、梯度消失/爆炸、Adam优化算法，神经网络优化算法看这一篇就够了

Dropout、梯度消失/爆炸、Adam优化算法，神经网络优化算法看这一篇就够了

AI100

14+阅读 · 2019年9月1日

一行TensorFlow/Keras代码解决真实场景中数据不平衡(imbalanced)问题

一行TensorFlow/Keras代码解决真实场景中数据不平衡(imbalanced)问题

专知

78+阅读 · 2019年5月31日

《小样本学习(Few-shot learning)》最新41页综述论文，来自港科大和第四范式

《小样本学习(Few-shot learning)》最新41页综述论文，来自港科大和第四范式

专知

363+阅读 · 2019年4月12日

机器学习中如何处理不平衡数据？

机器学习中如何处理不平衡数据？

机器之心

13+阅读 · 2019年2月17日

详解常见的损失函数

详解常见的损失函数

七月在线实验室

20+阅读 · 2018年7月12日

关于处理样本不平衡问题的Trick整理

关于处理样本不平衡问题的Trick整理

机器学习算法与Python学习

14+阅读 · 2017年12月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

FCS 论坛 | 孟德宇：误差建模原理

FCS 论坛 | 孟德宇：误差建模原理

FCS

15+阅读 · 2017年8月17日

相关论文

The Statistical Fairness-Accuracy Frontier

Arxiv

0+阅读 · 2月16日

Empirical Likelihood-Based Fairness Auditing: Distribution-Free Certification and Flagging

Arxiv

0+阅读 · 2月12日

Double Fairness Policy Learning: Integrating Action Fairness and Outcome Fairness in Decision-making

Arxiv

0+阅读 · 2月9日

Fair Feature Importance Scores via Feature Occlusion and Permutation

Arxiv

0+阅读 · 2月9日

Fairness Aware Reward Optimization

Arxiv

0+阅读 · 2月8日

Understanding Fairness and Prediction Error through Subspace Decomposition and Influence Analysis

Arxiv

0+阅读 · 2月7日

Maximin Relative Improvement: Fair Learning as a Bargaining Problem

Arxiv

0+阅读 · 2月4日

Post-Training Fairness Control: A Single-Train Framework for Dynamic Fairness in Recommendation

Arxiv

0+阅读 · 1月28日

Empirical Likelihood-Based Fairness Auditing: Distribution-Free Certification and Flagging

Arxiv

0+阅读 · 1月28日

Doubly-Regressing Approach for Subgroup Fairness

Arxiv

0+阅读 · 1月27日

相关基金

群体偏好的敏感性度量方法研究和群决策方法的可实施性评价

国家自然科学基金

0+阅读 · 2017年12月31日

测量误差数据下部分线性模型有约束统计推断理论

国家自然科学基金

2+阅读 · 2015年12月31日

信息不完全的双边匹配决策方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

半参数回归模型中随机误差分布的检验问题

国家自然科学基金

2+阅读 · 2015年12月31日

排序与半监督学习的误差分析

国家自然科学基金

0+阅读 · 2015年12月31日

若干偏微分方程控制系统的适定正则性及稳定性分析

国家自然科学基金

0+阅读 · 2015年12月31日

基于云计算平台的下一代测序数据错误修正算法研究与实现

国家自然科学基金

2+阅读 · 2015年12月31日

方差正则化的分类模型选择方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

测量误差数据下约束线性模型的有偏估计及变量选择研究

国家自然科学基金

0+阅读 · 2014年12月31日

相依回归模型与扩散过程的统计推断及其应用

国家自然科学基金

1+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员