Counterfactual Explanations (CE) is the de facto method for providing insight and interpretability in black-box decision-making models by identifying alternative input instances that lead to different outcomes. This paper extends the concept of CEs to a distributional context, broadening the scope from individual data points to entire input and output distributions, named Distributional Counterfactual Explanation (DCE). In DCE, our focus shifts to analyzing the distributional properties of the factual and counterfactual, drawing parallels to the classical approach of assessing individual instances and their resulting decisions. We leverage Optimal Transport (OT) to frame a chance-constrained optimization problem, aiming to derive a counterfactual distribution that closely aligns with its factual counterpart, substantiated by statistical confidence. Our proposed optimization method, DISCOUNT, strategically balances this confidence across both input and output distributions. This algorithm is accompanied by an analysis of its convergence rate. The efficacy of our proposed method is substantiated through a series of illustrative case studies, highlighting its potential in providing deep insights into decision-making models.
翻译:反事实解释(CE)是黑箱决策模型中提供洞察与可解释性的标准方法,通过识别导致不同结果的替代输入实例实现。本文将反事实解释的概念扩展至分布语境,将范围从个体数据点拓展至整个输入与输出分布,命名为分布反事实解释(DCE)。在DCE中,我们聚焦于分析事实分布与反事实分布的分布特性,与评估个体实例及其决策结果的传统方法形成类比。我们利用最优输运(Optimal Transport, OT)构建机会约束优化问题,旨在推导出与事实分布高度一致的反事实分布,并辅以统计置信度支撑。提出的优化方法DISCOUNT,能在输入与输出分布间策略性地平衡该置信度。本文同时对该算法的收敛速率进行了分析。通过一系列典型案例研究,验证了所提方法的有效性,凸显其在提供深度决策模型洞察方面的潜力。