Conditional Generative Models are Sufficient to Sample from Any Causal Effect Estimand

Causal inference from observational data plays critical role in many applications in trustworthy machine learning. While sound and complete algorithms exist to compute causal effects, many of them assume access to conditional likelihoods, which is difficult to estimate for high-dimensional (particularly image) data. Researchers have alleviated this issue by simulating causal relations with neural models. However, when we have high-dimensional variables in the causal graph along with some unobserved confounders, no existing work can effectively sample from the un/conditional interventional distributions. In this work, we show how to sample from any identifiable interventional distribution given an arbitrary causal graph through a sequence of push-forward computations of conditional generative models, such as diffusion models. Our proposed algorithm follows the recursive steps of the existing likelihood-based identification algorithms to train a set of feed-forward models, and connect them in a specific way to sample from the desired distribution. We conduct experiments on a Colored MNIST dataset having both the treatment ($X$) and the target variables ($Y$) as images and sample from $P(y|do(x))$. Our algorithm also enables us to conduct a causal analysis to evaluate spurious correlations among input features of generative models pre-trained on the CelebA dataset. Finally, we generate high-dimensional interventional samples from the MIMIC-CXR dataset involving text and image variables.

翻译：从观测数据中进行因果推断在可信机器学习领域的许多应用中发挥着关键作用。虽然存在完备的算法来计算因果效应，但其中许多算法假设能够获取条件似然，这对于高维（尤其是图像）数据而言难以估计。研究人员通过使用神经模型模拟因果关系来缓解此问题。然而，当因果图中存在高维变量以及部分未观测混杂因子时，现有方法均无法有效从未/条件干预分布中采样。本研究表明，通过一系列条件生成模型（如扩散模型）的前向映射计算，如何从任意给定因果图的可识别干预分布中采样。所提算法遵循现有基于似然的识别算法的递归步骤，训练一组前馈模型，并以特定方式连接它们以从目标分布中采样。我们在一个彩色MNIST数据集上进行实验，该数据集的治疗变量（$X$）与目标变量（$Y$）均为图像，并从$P(y|do(x))$中采样。该算法还使我们能够进行因果分析，以评估在CelebA数据集上预训练的生成模型输入特征间的伪相关性。最后，我们从涉及文本和图像变量的MIMIC-CXR数据集中生成了高维干预样本。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日