We address the setting of Proxy Causal Learning (PCL), which has the goal of estimating causal effects from observed data in the presence of hidden confounding. Proxy methods accomplish this task using two proxy variables related to the latent confounder: a treatment proxy (related to the treatment) and an outcome proxy (related to the outcome). Two approaches have been proposed to perform causal effect estimation given proxy variables; however only one of these has found mainstream acceptance, since the other was understood to require density ratio estimation - a challenging task in high dimensions. In the present work, we propose a practical and effective implementation of the second approach, which bypasses explicit density ratio estimation and is suitable for continuous and high-dimensional treatments. We employ kernel ridge regression to derive estimators, resulting in simple closed-form solutions for dose-response and conditional dose-response curves, along with consistency guarantees. Our methods empirically demonstrate superior or comparable performance to existing frameworks on synthetic and real-world datasets.
翻译:我们研究了代理因果学习(PCL)这一设置,其目标是在存在隐藏混杂因素的情况下,从观测数据中估计因果效应。代理方法通过使用与潜在混杂变量相关的两个代理变量来完成此任务:治疗代理(与治疗相关)和结果代理(与结果相关)。目前已有两种方法被提出用于在给定代理变量的情况下进行因果效应估计;然而,其中只有一种方法得到了主流认可,因为另一种方法被认为需要密度比估计——在高维空间中这是一项具有挑战性的任务。在本文中,我们提出了第二种方法的一种实用且有效的实现,它绕过了显式的密度比估计,并且适用于连续和高维治疗变量。我们采用核岭回归来推导估计量,从而得到剂量-反应曲线和条件剂量-反应曲线的简单闭式解,并附带一致性保证。我们的方法在合成数据集和真实世界数据集上,实证表现优于或可媲美现有框架。