Unfair behaviors of Machine Learning (ML) software have garnered increasing attention and concern among software engineers. To tackle this issue, extensive research has been dedicated to conducting fairness testing of ML software, and this paper offers a comprehensive survey of existing studies in this field. We collect 100 papers and organize them based on the testing workflow (i.e., how to test) and testing components (i.e., what to test). Furthermore, we analyze the research focus, trends, and promising directions in the realm of fairness testing. We also identify widely-adopted datasets and open-source tools for fairness testing.
翻译:机器学习(ML)软件的不公平行为日益引起软件工程师的关注与担忧。为解决这一问题,大量研究致力于ML软件的公平性测试,本文对现有相关研究进行了全面综述。我们收集了100篇论文,并根据测试工作流(即如何测试)与测试组件(即测试什么)对它们进行组织整理。此外,我们分析了公平性测试领域的研究重点、趋势及有前景的方向,并梳理了该领域广泛采用的数据集与开源工具。