Counterfactual Explanations for Deep Two-Sample Testing

Two-sample testing is a fundamental tool for detecting distributional differences across scientific domains, but classical tests (including kernel-based tests) can be ineffective on high-dimensional structured data such as images. Recent deep two-sample tests improve sensitivity in these settings by learning informative representations, yet they provide limited insight into which data features drive rejection of the null hypothesis $H_0$. To address this issue, we propose a counterfactual explanation framework for deep two-sample testing that generates sample-level edits moving observations from a source group toward a target group while explicitly reducing the discrepancy measured by the test. Our method combines a diffusion autoencoder with a pretrained deep two-sample test model and optimizes a maximum mean discrepancy (MMD) objective in the test model's representation space to produce plausible counterfactuals. We quantify distribution-level effects through changes in the test statistic and the resulting two-sample p-values. We evaluate the method on synthetic 2D shape datasets and two MRI cohorts. Across both settings, the counterfactual transformations consistently increase p-values relative to the original samples, indicating that the edited source set becomes statistically closer to the target distribution under the test. We measure minimality using LPIPS to ensure the counterfactuals remain close to the original samples. The resulting edits provide interpretable evidence of the features associated with the detected group differences. On MRI, the localized changes are consistent with known anatomical differences between cohorts.

翻译：双样本检验是跨科学领域检测分布差异的基本工具，但经典检验方法（包括基于核的检验）在处理图像等高维结构化数据时效果有限。最新提出的深度双样本检验通过学习信息表征提升了对这类数据的敏感性，却未能充分揭示驱动零假设$H_0$被拒绝的数据特征。针对该问题，本文提出面向深度双样本检验的反事实解释框架，通过生成样本级编辑操作，将观测样本从源群体向目标群体迁移，同时显式降低检验方法所测量的分布差异。该方法将扩散自编码器与预训练的深度双样本检验模型相结合，通过在检验模型表征空间中优化最大均值差异（MMD）目标函数，生成合理的反事实样本。我们通过检验统计量及对应的双样本p值变化量化分布层面的效应。在合成二维形状数据集与两个MRI群体队列上的实验表明：反事实变换一致性地提升了原始样本的p值，表明编辑后源集与目标分布在统计检验下更为接近。采用LPIPS测度最小化原则，确保反事实样本与原样本保持高度相似。所生成的编辑结果为检测到的群体差异特征提供了可解释性证据。在MRI数据上，局部化变化与已知的群体队列解剖学差异特征一致。

相关内容

GROUP

关注 1

Group一直是研究计算机支持的合作工作、人机交互、计算机支持的协作学习和社会技术研究的主要场所。该会议将社会科学、计算机科学、工程、设计、价值观以及其他与小组工作相关的多个不同主题的工作结合起来，并进行了广泛的概念化。官网链接：https://group.acm.org/conferences/group20/

图上如何可解释？首篇《图反事实解释:定义、方法、评价》综述，46页pdf165篇文献全面概述图反事实解释进展

专知会员服务

41+阅读 · 2022年10月24日

【干货书】深度伪造 (DeepFakes):创造，检测和影响，167页pdf

专知会员服务

69+阅读 · 2022年8月1日

反事实如何理解？看这份华为KDD2021《反事实解释及在XAI中的应用》教程，附143页Slides

专知会员服务

105+阅读 · 2021年8月16日

基于深度学习的跨模态检索综述

专知会员服务

63+阅读 · 2021年3月25日