Bias-inducing geometries: an exactly solvable data model with fairness implications

Machine learning (ML) may be oblivious to human bias but it is not immune to its perpetuation. Marginalisation and iniquitous group representation are often traceable in the very data used for training, and may be reflected or even enhanced by the learning models. In the present work, we aim at clarifying the role played by data geometry in the emergence of ML bias. We introduce an exactly solvable high-dimensional model of data imbalance, where parametric control over the many bias-inducing factors allows for an extensive exploration of the bias inheritance mechanism. Through the tools of statistical physics, we analytically characterise the typical properties of learning models trained in this synthetic framework and obtain exact predictions for the observables that are commonly employed for fairness assessment. Despite the simplicity of the data model, we retrace and unpack typical unfairness behaviour observed on real-world datasets. We also obtain a detailed analytical characterisation of a class of bias mitigation strategies. We first consider a basic loss-reweighing scheme, which allows for an implicit minimisation of different unfairness metrics, and quantify the incompatibilities between some existing fairness criteria. Then, we consider a novel mitigation strategy based on a matched inference approach, consisting in the introduction of coupled learning models. Our theoretical analysis of this approach shows that the coupled strategy can strike superior fairness-accuracy trade-offs.

翻译：机器学习（ML）可能对人类偏见无意识，但并非免疫于其延续。边缘化和不公正的群体表征往往可追溯至训练数据本身，并可能被学习模型反映甚至放大。在本工作中，我们旨在阐明数据几何结构在ML偏见产生中所起的作用。我们引入了一个可精确求解的高维数据不平衡模型，通过对众多偏见诱导因素的参数化控制，实现了对偏见继承机制的广泛探索。借助统计物理学的工具，我们解析地刻画了在该合成框架下训练的学习模型的典型性质，并对常用于公平性评估的观测量给出了精确预测。尽管数据模型简单，我们仍复现并解析了在现实数据集中观察到的典型不公平行为。我们还获得了一类偏见缓解策略的详细解析刻画。首先考虑一种基础损失重加权方案，该方案允许隐式最小化不同的不公平性度量，并量化了某些现有公平性准则之间的不相容性。随后，我们提出一种基于匹配推断方法的新型缓解策略，通过引入耦合学习模型实现。对该策略的理论分析表明，耦合方法能够实现更优的公平性-准确性权衡。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

【AI应用】Facebook-利用神经网络求解高等数学方程, Using neural networks to solve advanced mathematics equations

专知会员服务

34+阅读 · 2020年1月15日