The Right to Explanation and the Right to be Forgotten are two important principles outlined to regulate algorithmic decision making and data usage in real-world applications. While the right to explanation allows individuals to request an actionable explanation for an algorithmic decision, the right to be forgotten grants them the right to ask for their data to be deleted from all the databases and models of an organization. Intuitively, enforcing the right to be forgotten may trigger model updates which in turn invalidate previously provided explanations, thus violating the right to explanation. In this work, we investigate the technical implications arising due to the interference between the two aforementioned regulatory principles, and propose the first algorithmic framework to resolve the tension between them. To this end, we formulate a novel optimization problem to generate explanations that are robust to model updates due to the removal of training data instances by data deletion requests. We then derive an efficient approximation algorithm to handle the combinatorial complexity of this optimization problem. We theoretically demonstrate that our method generates explanations that are provably robust to worst-case data deletion requests with bounded costs in case of linear models and certain classes of non-linear models. Extensive experimentation with real-world datasets demonstrates the efficacy of the proposed framework.
翻译:解释权与被遗忘权是规范现实应用中算法决策与数据使用的两项重要原则。解释权允许个人就算法决策请求可操作的解释,而被遗忘权则赋予其要求从组织所有数据库和模型中删除个人数据的权利。直观而言,执行被遗忘权可能触发模型更新,进而使先前提供的解释失效,从而违反解释权。本文研究了上述两项监管原则相互干扰所引发的技术影响,并首次提出解决两者间冲突的算法框架。为此,我们构建了一个新的优化问题,旨在生成对因数据删除请求移除训练样本而导致的模型更新具有鲁棒性的解释。随后,我们推导出一种高效近似算法以应对该优化问题的组合复杂性。我们从理论上证明,在线性模型及特定非线性模型类别中,本方法生成的解释能显著应对最坏情况下的数据删除请求,且代价有界。基于真实数据集的广泛实验验证了所提框架的有效性。