Collaborative filtering based recommendation learns users' preferences from all users' historical behavior data, and has been popular to facilitate decision making. R Recently, the fairness issue of recommendation has become more and more essential. A recommender system is considered unfair when it does not perform equally well for different user groups according to users' sensitive attributes~(e.g., gender, race). Plenty of methods have been proposed to alleviate unfairness by optimizing a predefined fairness goal or changing the distribution of unbalanced training data. However, they either suffered from the specific fairness optimization metrics or relied on redesigning the current recommendation architecture. In this paper, we study how to improve recommendation fairness from the data augmentation perspective. The recommendation model amplifies the inherent unfairness of imbalanced training data. We augment imbalanced training data towards balanced data distribution to improve fairness. The proposed framework is generally applicable to any embedding-based recommendation, and does not need to pre-define a fairness metric. Extensive experiments on two real-world datasets clearly demonstrate the superiority of our proposed framework. We publish the source code at https://github.com/newlei/FDA.
翻译:基于协同过滤的推荐通过学习所有用户的历史行为数据来理解用户偏好,已成为辅助决策的常用方法。近年来,推荐系统的公平性问题日益重要。当推荐系统根据用户的敏感属性(如性别、种族)对不同用户群体表现不一致时,即被认为存在不公平性。已有大量方法通过优化预定义的公平目标或调整不平衡训练数据的分布来缓解不公平问题,但这些方法要么受限于特定的公平性优化指标,要么需要重新设计现有推荐架构。本文从数据增强的角度研究如何提升推荐公平性。推荐模型会放大不平衡训练数据中固有的不公平性,我们通过对不平衡训练数据进行增强以平衡数据分布,从而提升公平性。所提出的框架可广泛适用于任何基于嵌入的推荐方法,且无需预定义公平性指标。在两个真实数据集上的大量实验清晰证明了我们框架的优越性。我们在 https://github.com/newlei/FDA 发布源代码。