FairLay-ML: Intuitive Remedies for Unfairness in Data-Driven Social-Critical Algorithms

This thesis explores open-sourced machine learning (ML) model explanation tools to understand whether these tools can allow a layman to visualize, understand, and suggest intuitive remedies to unfairness in ML-based decision-support systems. Machine learning models trained on datasets biased against minority groups are increasingly used to guide life-altering social decisions, prompting the urgent need to study their logic for unfairness. Due to this problem's impact on vast populations of the general public, it is critical for the layperson -- not just subject matter experts in social justice or machine learning experts -- to understand the nature of unfairness within these algorithms and the potential trade-offs. Existing research on fairness in machine learning focuses mostly on the mathematical definitions and tools to understand and remedy unfair models, with some directly citing user-interactive tools as necessary for future work. This thesis presents FairLay-ML, a proof-of-concept GUI integrating some of the most promising tools to provide intuitive explanations for unfair logic in ML models by integrating existing research tools (e.g. Local Interpretable Model-Agnostic Explanations) with existing ML-focused GUI (e.g. Python Streamlit). We test FairLay-ML using models of various accuracy and fairness generated by an unfairness detector tool, Parfait-ML, and validate our results using Themis. Our study finds that the technology stack used for FairLay-ML makes it easy to install and provides real-time black-box explanations of pre-trained models to users. Furthermore, the explanations provided translate to actionable remedies.

翻译：本论文探讨开源机器学习模型解释工具，旨在理解这些工具是否能让非专业人士可视化、理解并建议基于机器学习的决策支持系统中不公平问题的直观补救措施。在针对少数群体存在偏倚的数据集上训练的机器学习模型，正日益被用于指导改变人生的社会决策，这迫切需要我们研究其逻辑中是否存在不公平性。鉴于该问题对广大公众群体的影响，不仅社会正义领域专家或机器学习专家，普通人也必须理解这些算法中不公平性的本质及其潜在权衡。现有机器学习公平性研究主要集中于数学定义及理解与补救不公平模型的工具，其中部分研究直接指出用户交互式工具是未来工作的必要方向。本文提出FairLay-ML，一个概念验证型图形用户界面，通过整合现有研究工具（例如局部可解释模型无关解释）与现有机器学习图形用户界面（例如Python Streamlit），为机器学习模型中的不公平逻辑提供直观解释。我们使用由不公平性检测工具Parfait-ML生成的不同准确率和公平性水平的模型测试FairLay-ML，并通过Themis验证结果。研究发现，FairLay-ML所采用的技术栈易于安装，并能向用户提供预训练模型的实时黑箱解释。此外，所提供的解释可转化为可操作的补救措施。

相关内容

TOOLS

关注 1

这个新版本的工具会议系列恢复了从1989年到2012年的50个会议的传统。工具最初是“面向对象语言和系统的技术”，后来发展到包括软件技术的所有创新方面。今天许多最重要的软件概念都是在这里首次引入的。2019年TOOLS 50+1在俄罗斯喀山附近举行，以同样的创新精神、对所有与软件相关的事物的热情、科学稳健性和行业适用性的结合以及欢迎该领域所有趋势和社区的开放态度，延续了该系列。官网链接：http://tools2019.innopolis.ru/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日