There has been an increasing interest in enhancing the fairness of machine learning (ML). Despite the growing number of fairness-improving methods, we lack a systematic understanding of the trade-offs among factors considered in the ML pipeline when fairness-improving methods are applied. This understanding is essential for developers to make informed decisions regarding the provision of fair ML services. Nonetheless, it is extremely difficult to analyze the trade-offs when there are multiple fairness parameters and other crucial metrics involved, coupled, and even in conflict with one another. This paper uses causality analysis as a principled method for analyzing trade-offs between fairness parameters and other crucial metrics in ML pipelines. To ractically and effectively conduct causality analysis, we propose a set of domain-specific optimizations to facilitate accurate causal discovery and a unified, novel interface for trade-off analysis based on well-established causal inference methods. We conduct a comprehensive empirical study using three real-world datasets on a collection of widelyused fairness-improving techniques. Our study obtains actionable suggestions for users and developers of fair ML. We further demonstrate the versatile usage of our approach in selecting the optimal fairness-improving method, paving the way for more ethical and socially responsible AI technologies.
翻译:近年来,提升机器学习公平性的研究日益受到关注。尽管公平性改进方法层出不穷,但在应用这些方法时,我们缺乏对机器学习流程中各类因素间权衡关系的系统性理解。这种理解对于开发者就提供公平机器学习服务做出明智决策至关重要。然而,当涉及多个公平性参数及其他关键指标相互耦合甚至冲突时,分析这些权衡关系变得极为困难。本文采用因果分析作为原理性方法,用于分析机器学习流程中公平性参数与其他关键指标间的权衡关系。为实用且高效地实施因果分析,我们提出一组领域特定优化策略以促进精确因果发现,并基于成熟因果推断方法构建统一的新型权衡分析接口。我们利用三个真实数据集,对一系列广泛使用的公平性改进技术进行了全面实证研究。研究结果为公平机器学习的用户和开发者提供了可操作建议。我们进一步展示了该方法在优选公平性改进方案中的多样化应用,为发展更具伦理意识和社会责任感的人工智能技术奠定基础。