An Information-Flow Perspective on Algorithmic Fairness

This work presents insights gained by investigating the relationship between algorithmic fairness and the concept of secure information flow. The problem of enforcing secure information flow is well-studied in the context of information security: If secret information may "flow" through an algorithm or program in such a way that it can influence the program's output, then that is considered insecure information flow as attackers could potentially observe (parts of) the secret. There is a strong correspondence between secure information flow and algorithmic fairness: if protected attributes such as race, gender, or age are treated as secret program inputs, then secure information flow means that these ``secret'' attributes cannot influence the result of a program. While most research in algorithmic fairness evaluation concentrates on studying the impact of algorithms (often treating the algorithm as a black-box), the concepts derived from information flow can be used both for the analysis of disparate treatment as well as disparate impact w.r.t. a structural causal model. In this paper, we examine the relationship between quantitative as well as qualitative information-flow properties and fairness. Moreover, based on this duality, we derive a new quantitative notion of fairness called fairness spread, which can be easily analyzed using quantitative information flow and which strongly relates to counterfactual fairness. We demonstrate that off-the-shelf tools for information-flow properties can be used in order to formally analyze a program's algorithmic fairness properties, including the new notion of fairness spread as well as established notions such as demographic parity.

翻译：本文通过研究算法公平性与安全信息流概念之间的关系，提出了若干见解。在信息安全领域，强制实施安全信息流的问题已得到充分研究：若机密信息可能通过算法或程序“流动”，从而影响程序输出，则此类情况被视为不安全信息流，因为攻击者有可能观察（部分）机密信息。安全信息流与算法公平性之间存在强对应关系：若将种族、性别、年龄等受保护属性视为程序中的机密输入，则安全信息流意味着这些“机密”属性不能影响程序结果。尽管大多数算法公平性评估研究侧重于分析算法的影响（通常将算法视为黑箱），但从信息流推导出的概念既可用于分析差别对待，也可用于分析结构性因果模型中的差别影响。本文探讨了定量及定性信息流属性与公平性之间的关系。此外，基于这种对偶性，我们提出了一种新的定量公平性概念——公平性扩散度，该概念可通过定量信息流轻松分析，并与反事实公平性密切相关。我们证明，现成的信息流属性工具可用于形式化分析程序的算法公平性属性，包括新提出的公平性扩散度概念以及已确立的概念（如人口统计均等性）。