Prediction decomposition for causal analysis - 专知论文

会员服务 ·

0

分析 · 因果分析 · 分解 · 单元 · 反事实 ·

Prediction decomposition for causal analysis

翻译：预测分解用于因果分析

from arxiv, 22 pages, 7 figures

There is rising interest in using Machine Learning (ML) model predictions as outcomes in causal analysis. However, these methods have faced challenges in finding the true treatment effects. It is also challenging to make choices about which prediction models to choose, since we are interested not only in the accuracy of the prediction but in its ability to produce the correct causal effect in the analysis. In this paper I propose a decomposition of the prediction into between-unit prediction ($η_μ$), within-unit-across-time prediction ($η_ε$), and counterfactual-treatment-effect prediction ($η_T$). I show that the counterfactual-treatment-effect component is the one that determines whether the model recovers the true treatment effect, but only the first two components can be estimated from non-experimental data. I argue that within-unit-across-time prediction accuracy ($η_ε$) is a structurally better proxy for the counterfactual-treatment-effect component ($η_T$) than overall prediction accuracy, and propose a metric to estimate it from panel data with at least two time periods. This metric serves as a diagnostic and model-selection tool for choosing ML models for causal analysis. Under the stronger assumption that $η_T \approx η_ε$, it also enables constructing an approximately unbiased estimate of the treatment effect. I develop the theoretical framework and illustrate it with simulations of synthetic data.

翻译：机器学习（ML）模型预测作为因果分析中的结果变量正引起越来越多的关注。然而，这些方法在寻找真实处理效应方面面临挑战。由于我们不仅关注预测的准确性，更关注其在分析中产生正确因果效应的能力，因此选择合适的预测模型也颇具挑战性。本文提出将预测分解为单元间预测（η_μ）、单元内跨时间预测（η_ε）以及反事实处理效应预测（η_T）。我证明反事实处理效应分量是决定模型能否恢复真实处理效应的关键，但仅有前两个分量可从非实验数据中估计。我提出单元内跨时间预测精度（η_ε）在结构上比总体预测精度更优地作为反事实处理效应分量（η_T）的代理指标，并基于至少两个时间周期的面板数据设计出相应的估计指标。该指标可作为为因果分析选择ML模型的诊断与模型选择工具。在η_T ≈ η_ε这一更强假设下，它还能构建近似无偏的处理效应估计量。我建立了理论框架并通过合成数据模拟予以验证。

0

相关内容

【牛津博士论文】大规模观测因果机器学习中的结构与统计不确定性

【牛津博士论文】大规模观测因果机器学习中的结构与统计不确定性

专知会员服务

29+阅读 · 2024年9月29日

最新《因果推断导论》，51页ppt，剑桥大学助理教授Qingyuan Zhao讲解

最新《因果推断导论》，51页ppt，剑桥大学助理教授Qingyuan Zhao讲解

专知会员服务

41+阅读 · 2022年8月28日

【MIT博士论文】机器学习与因果关系:建立高效、可靠的决策模型

【MIT博士论文】机器学习与因果关系:建立高效、可靠的决策模型

专知会员服务

112+阅读 · 2022年7月10日

253页PPT！《因果性Causality》教程，哥本哈根大学Jonas Peters讲授

253页PPT！《因果性Causality》教程，哥本哈根大学Jonas Peters讲授

专知会员服务

99+阅读 · 2022年7月7日

因果如何用于推荐？中科大最新WWW2022《因果推荐: 进展与未来方向》教程，附123页ppt

因果如何用于推荐？中科大最新WWW2022《因果推荐: 进展与未来方向》教程，附123页ppt

专知会员服务

108+阅读 · 2022年4月28日

自然语言处理中的因果推理:估计、预测、解释和超越

专知会员服务

94+阅读 · 2021年9月5日

因果推断，Causal Inference：The Mixtape

因果推断，Causal Inference：The Mixtape

专知会员服务

110+阅读 · 2021年8月27日

【MPG & MILA 】因果表示学习，Towards Causal Representation Learning

专知会员服务

52+阅读 · 2021年7月29日

「因果推理」概述论文，13页pdf

专知会员服务

101+阅读 · 2021年3月20日

最新「因果推断Causal Inference」综述论文38页pdf，Buffalo、Georgia、阿里巴巴、Virginia

专知会员服务

183+阅读 · 2020年2月11日

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

专知

18+阅读 · 2022年4月9日

「因果推理」概述论文，13页pdf

「因果推理」概述论文，13页pdf

专知

16+阅读 · 2021年3月20日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知

11+阅读 · 2020年8月28日

基于深度元学习的因果推断新方法

基于深度元学习的因果推断新方法

图与推荐

12+阅读 · 2020年7月21日

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

专知

68+阅读 · 2020年2月11日

金融时序预测中的深度学习方法综述: 从2005到2019，附63页pdf下载

金融时序预测中的深度学习方法综述: 从2005到2019，附63页pdf下载

专知

70+阅读 · 2019年12月4日

用深度学习揭示数据的因果关系

用深度学习揭示数据的因果关系

专知

28+阅读 · 2019年5月18日

让智能体主动交互，DeepMind提出用元强化学习实现因果推理

让智能体主动交互，DeepMind提出用元强化学习实现因果推理

机器之心

17+阅读 · 2019年2月11日

15款免费预测分析软件！收藏好，别丢了！

15款免费预测分析软件！收藏好，别丢了！

七月在线实验室

11+阅读 · 2018年2月27日

回归预测&时间序列预测

回归预测&时间序列预测

GBASE数据工程部数据团队

44+阅读 · 2017年5月17日

贝叶斯网分解理论及其应用

国家自然科学基金

9+阅读 · 2017年12月31日

测量误差数据下部分线性模型有约束统计推断理论

国家自然科学基金

2+阅读 · 2015年12月31日

高维回归模型的预测稳定性研究

国家自然科学基金

3+阅读 · 2015年12月31日

方差正则化的分类模型选择方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于2-D空间离散数据的质量与产出的预测方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

相依回归模型与扩散过程的统计推断及其应用

国家自然科学基金

1+阅读 · 2014年12月31日

含有隐变量的因果结构学习与统计因果推断

国家自然科学基金

21+阅读 · 2013年12月31日

概率图模型学习及其在数据分析中的应用研究

国家自然科学基金

16+阅读 · 2013年12月31日

因果推断的统计方法

国家自然科学基金

26+阅读 · 2011年12月31日

因果推断及不完全数据的统计分析

国家自然科学基金

23+阅读 · 2008年12月31日

Causal Variance Decompositions for Measuring Health Inequalities

Arxiv

0+阅读 · 4月24日

Symbolic Quantile Regression for the Interpretable Prediction of Conditional Quantiles

Arxiv

0+阅读 · 4月21日

Causal Meta-Analysis: Rethinking the Foundations of Evidence-Based Medicine

Arxiv

0+阅读 · 4月20日

Integrating Causal Machine Learning into Clinical Decision Support Systems: Insights from Literature and Practice

Arxiv

0+阅读 · 4月16日

Intervening to Learn and Compose Causally Disentangled Representations

Arxiv

0+阅读 · 4月2日

CausalPre: Scalable and Effective Data Pre-Processing for Causal Fairness

Arxiv

0+阅读 · 3月26日

Algorithms with Calibrated Machine Learning Predictions

Arxiv

0+阅读 · 3月25日

Demystifying amortized causal discovery with transformers

Arxiv

0+阅读 · 3月18日

A Review and Roadmap of Deep Causal Model from Different Causal Structures and Representations

Arxiv

13+阅读 · 2023年11月2日

A Survey on Causal Reinforcement Learning

Arxiv

29+阅读 · 2023年2月10日

VIP会员

文章信息

相关主题

最新内容

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

1+阅读 · 41分钟前

综述 | 3D场景图：开放挑战与未来方向

综述 | 3D场景图：开放挑战与未来方向

专知会员服务

1+阅读 · 43分钟前

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

专知会员服务

2+阅读 · 今天14:30

21世纪的无人机战争

21世纪的无人机战争

专知会员服务

2+阅读 · 今天14:05

《伊朗与以色列-美国热战及其对数字技术的影响》

《伊朗与以色列-美国热战及其对数字技术的影响》

专知会员服务

2+阅读 · 今天13:55

《量子技术的军事任务技术适配与利用》

《量子技术的军事任务技术适配与利用》

专知会员服务

2+阅读 · 今天13:51

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

专知会员服务

2+阅读 · 今天13:48

美国从乌克兰无人机战争中学习经验

美国从乌克兰无人机战争中学习经验

专知会员服务

7+阅读 · 6月21日

ICML 2026 | 面向视觉语言模型的语义鲁棒性认证

ICML 2026 | 面向视觉语言模型的语义鲁棒性认证

专知会员服务

5+阅读 · 6月21日

综述 | 智能体电子设计自动化：从“交接有效性”重新理解Agentic EDA

综述 | 智能体电子设计自动化：从“交接有效性”重新理解Agentic EDA

专知会员服务

7+阅读 · 6月21日

深入解读 Palantir AIP：全球最具争议的人工智能平台究竟如何运作

深入解读 Palantir AIP：全球最具争议的人工智能平台究竟如何运作

专知会员服务

20+阅读 · 6月20日

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

专知会员服务

5+阅读 · 6月19日

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

专知会员服务

8+阅读 · 6月19日

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

专知会员服务

7+阅读 · 6月18日

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

专知会员服务

9+阅读 · 6月18日

相关VIP内容

【牛津博士论文】大规模观测因果机器学习中的结构与统计不确定性

【牛津博士论文】大规模观测因果机器学习中的结构与统计不确定性

专知会员服务

29+阅读 · 2024年9月29日

最新《因果推断导论》，51页ppt，剑桥大学助理教授Qingyuan Zhao讲解

最新《因果推断导论》，51页ppt，剑桥大学助理教授Qingyuan Zhao讲解

专知会员服务

41+阅读 · 2022年8月28日

【MIT博士论文】机器学习与因果关系:建立高效、可靠的决策模型

【MIT博士论文】机器学习与因果关系:建立高效、可靠的决策模型

专知会员服务

112+阅读 · 2022年7月10日

253页PPT！《因果性Causality》教程，哥本哈根大学Jonas Peters讲授

253页PPT！《因果性Causality》教程，哥本哈根大学Jonas Peters讲授

专知会员服务

99+阅读 · 2022年7月7日

因果如何用于推荐？中科大最新WWW2022《因果推荐: 进展与未来方向》教程，附123页ppt

因果如何用于推荐？中科大最新WWW2022《因果推荐: 进展与未来方向》教程，附123页ppt

专知会员服务

108+阅读 · 2022年4月28日

自然语言处理中的因果推理:估计、预测、解释和超越

专知会员服务

94+阅读 · 2021年9月5日

因果推断，Causal Inference：The Mixtape

因果推断，Causal Inference：The Mixtape

专知会员服务

110+阅读 · 2021年8月27日

【MPG & MILA 】因果表示学习，Towards Causal Representation Learning

专知会员服务

52+阅读 · 2021年7月29日

「因果推理」概述论文，13页pdf

专知会员服务

101+阅读 · 2021年3月20日

最新「因果推断Causal Inference」综述论文38页pdf，Buffalo、Georgia、阿里巴巴、Virginia

专知会员服务

183+阅读 · 2020年2月11日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 3D场景图：开放挑战与未来方向

21世纪的无人机战争

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

相关资讯

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

专知

18+阅读 · 2022年4月9日

「因果推理」概述论文，13页pdf

「因果推理」概述论文，13页pdf

专知

16+阅读 · 2021年3月20日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知

11+阅读 · 2020年8月28日

基于深度元学习的因果推断新方法

基于深度元学习的因果推断新方法

图与推荐

12+阅读 · 2020年7月21日

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

专知

68+阅读 · 2020年2月11日

金融时序预测中的深度学习方法综述: 从2005到2019，附63页pdf下载

金融时序预测中的深度学习方法综述: 从2005到2019，附63页pdf下载

专知

70+阅读 · 2019年12月4日

用深度学习揭示数据的因果关系

用深度学习揭示数据的因果关系

专知

28+阅读 · 2019年5月18日

让智能体主动交互，DeepMind提出用元强化学习实现因果推理

让智能体主动交互，DeepMind提出用元强化学习实现因果推理

机器之心

17+阅读 · 2019年2月11日

15款免费预测分析软件！收藏好，别丢了！

15款免费预测分析软件！收藏好，别丢了！

七月在线实验室

11+阅读 · 2018年2月27日

回归预测&时间序列预测

回归预测&时间序列预测

GBASE数据工程部数据团队

44+阅读 · 2017年5月17日

相关论文

Causal Variance Decompositions for Measuring Health Inequalities

Arxiv

0+阅读 · 4月24日

Symbolic Quantile Regression for the Interpretable Prediction of Conditional Quantiles

Arxiv

0+阅读 · 4月21日

Causal Meta-Analysis: Rethinking the Foundations of Evidence-Based Medicine

Arxiv

0+阅读 · 4月20日

Integrating Causal Machine Learning into Clinical Decision Support Systems: Insights from Literature and Practice

Arxiv

0+阅读 · 4月16日

Intervening to Learn and Compose Causally Disentangled Representations

Arxiv

0+阅读 · 4月2日

CausalPre: Scalable and Effective Data Pre-Processing for Causal Fairness

Arxiv

0+阅读 · 3月26日

Algorithms with Calibrated Machine Learning Predictions

Arxiv

0+阅读 · 3月25日

Demystifying amortized causal discovery with transformers

Arxiv

0+阅读 · 3月18日

A Review and Roadmap of Deep Causal Model from Different Causal Structures and Representations

Arxiv

13+阅读 · 2023年11月2日

A Survey on Causal Reinforcement Learning

Arxiv

29+阅读 · 2023年2月10日

相关基金

贝叶斯网分解理论及其应用

国家自然科学基金

9+阅读 · 2017年12月31日

测量误差数据下部分线性模型有约束统计推断理论

国家自然科学基金

2+阅读 · 2015年12月31日

高维回归模型的预测稳定性研究

国家自然科学基金

3+阅读 · 2015年12月31日

方差正则化的分类模型选择方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于2-D空间离散数据的质量与产出的预测方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

相依回归模型与扩散过程的统计推断及其应用

国家自然科学基金

1+阅读 · 2014年12月31日

含有隐变量的因果结构学习与统计因果推断

国家自然科学基金

21+阅读 · 2013年12月31日

概率图模型学习及其在数据分析中的应用研究

国家自然科学基金

16+阅读 · 2013年12月31日

因果推断的统计方法

国家自然科学基金

26+阅读 · 2011年12月31日

因果推断及不完全数据的统计分析

国家自然科学基金

23+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员