Sample size and power calculations for causal inference of observational studies - 专知论文

会员服务 ·

0

样本 · 推断 · 得分 · 因果推断 · 关联 ·

Sample size and power calculations for causal inference of observational studies

翻译：观察性研究中因果推断的样本量与功效计算理论

Bo Liu,Chengxin Yang,Fan Li

This paper investigates the theoretical foundation and develops analytical formulas for sample size and power calculations for causal inference with observational data. By analyzing the variance of an inverse probability weighting estimator of the average treatment effect, we decompose the power calculation into three components: propensity score distribution, potential outcome distribution, and their correlation. We show that to determine the minimal sample size of an observational study, in addition to the standard inputs in the power calculation of randomized trials, it is sufficient to have two parameters, which quantify the strength of the confounder-treatment and the confounder-outcome association, respectively. For the former, we propose using the Bhattacharyya coefficient, which measures the covariate overlap and, together with the treatment proportion, leads to a uniquely identifiable and easily computable propensity score distribution. For the latter, we propose a sensitivity parameter bounded by the R-squared statistic of the regression of the outcome on covariates. Our procedure relies on a parametric propensity score model and a semiparametric restricted mean outcome model, but does not require distributional assumptions on the multivariate covariates. We develop an associated R package PSpower.

翻译：本文研究了基于观察性数据进行因果推断时样本量与功效计算的理论基础，并推导了相应的解析公式。通过分析平均处理效应的逆概率加权估计量的方差，我们将功效计算分解为三个组成部分：倾向得分分布、潜在结果分布及其相关性。研究表明，为确定观察性研究所需的最小样本量，除随机试验功效计算中的标准输入参数外，仅需两个分别量化混杂因素-处理关联强度和混杂因素-结果关联强度的参数即可。对于前者，我们提出使用Bhattacharyya系数来度量协变量重叠度，该系数与处理比例共同构成唯一可识别且易于计算的倾向得分分布。对于后者，我们提出以结果对协变量回归的R平方统计量作为边界约束的敏感性参数。本方法基于参数化倾向得分模型与半参数化限制平均结果模型，但无需对多元协变量进行分布假设。我们同步开发了相应的R软件包PSpower。

0

相关内容

基于因果推断的推荐系统去偏研究

基于因果推断的推荐系统去偏研究

专知会员服务

21+阅读 · 2024年11月10日

【牛津大学博士论文】用于因果学习与推理的可处理概率模型，240页pdf

【牛津大学博士论文】用于因果学习与推理的可处理概率模型，240页pdf

专知会员服务

46+阅读 · 2023年9月20日

【MIT博士论文】非参数因果推理的算法方法，424页pdf

【MIT博士论文】非参数因果推理的算法方法，424页pdf

专知会员服务

84+阅读 · 2022年9月20日

因果如何用于推荐？清华等最新《推荐系统中的因果推理》综述论文，29页pdf阐述因果推荐方法体系

因果如何用于推荐？清华等最新《推荐系统中的因果推理》综述论文，29页pdf阐述因果推荐方法体系

专知会员服务

48+阅读 · 2022年8月31日

最新《因果推断导论》，51页ppt，剑桥大学助理教授Qingyuan Zhao讲解

最新《因果推断导论》，51页ppt，剑桥大学助理教授Qingyuan Zhao讲解

专知会员服务

41+阅读 · 2022年8月28日

【经典书】统计学中的因果推断，156页pdf

【经典书】统计学中的因果推断，156页pdf

专知会员服务

98+阅读 · 2022年6月14日

《异构观测数据中的联合因果推理》美国艾莫利大学、微软、约翰霍普金斯大学、哈佛大学、斯坦福大学等联合发表最新论文63页PDF

《异构观测数据中的联合因果推理》美国艾莫利大学、微软、约翰霍普金斯大学、哈佛大学、斯坦福大学等联合发表最新论文63页PDF

专知会员服务

29+阅读 · 2022年4月28日

因果如何用于推荐？中科大最新WWW2022《因果推荐: 进展与未来方向》教程，附123页ppt

因果如何用于推荐？中科大最新WWW2022《因果推荐: 进展与未来方向》教程，附123页ppt

专知会员服务

108+阅读 · 2022年4月28日

最新《从观察数据发现因果性》，150页ppt

专知会员服务

66+阅读 · 2021年1月6日

最新「因果推断Causal Inference」综述论文38页pdf，Buffalo、Georgia、阿里巴巴、Virginia

专知会员服务

183+阅读 · 2020年2月11日

《因果性与机器学习综述》2022最新40页报告，美国陆军研究实验室

《因果性与机器学习综述》2022最新40页报告，美国陆军研究实验室

专知

12+阅读 · 2022年11月25日

「因果推理」概述论文，13页pdf

「因果推理」概述论文，13页pdf

专知

16+阅读 · 2021年3月20日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知

11+阅读 · 2020年8月28日

基于深度元学习的因果推断新方法

基于深度元学习的因果推断新方法

图与推荐

12+阅读 · 2020年7月21日

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

专知

68+阅读 · 2020年2月11日

因果推理学习算法资源大列表

因果推理学习算法资源大列表

专知

27+阅读 · 2019年3月3日

哈佛大学Miguel Hernan科学家最新2019年《因果推断:概念与方法》书稿终版，280页讲解因果效应（附下载）

哈佛大学Miguel Hernan科学家最新2019年《因果推断:概念与方法》书稿终版，280页讲解因果效应（附下载）

专知

77+阅读 · 2019年1月3日

北京大学何洋波博士《因果推断和因果图模型》机器学习报告

北京大学何洋波博士《因果推断和因果图模型》机器学习报告

专知

103+阅读 · 2018年11月11日

相关性≠因果：概率图模型和do-calculus

相关性≠因果：概率图模型和do-calculus

论智

31+阅读 · 2018年10月29日

【机器学习基本理论】详解最大似然估计（MLE）、最大后验概率估计（MAP），以及贝叶斯公式的理解

【机器学习基本理论】详解最大似然估计（MLE）、最大后验概率估计（MAP），以及贝叶斯公式的理解

机器学习研究会

19+阅读 · 2018年3月11日

测量误差数据下部分线性模型有约束统计推断理论

国家自然科学基金

2+阅读 · 2015年12月31日

基于复杂数据的回归模型统计推断及其应用

国家自然科学基金

3+阅读 · 2015年12月31日

随机波动率模型的统计推断及数值解

国家自然科学基金

1+阅读 · 2015年12月31日

基于部分核实数据的统计推断及应用

国家自然科学基金

0+阅读 · 2014年12月31日

相依回归模型与扩散过程的统计推断及其应用

国家自然科学基金

1+阅读 · 2014年12月31日

概率抽样设计及其统计推断方法

国家自然科学基金

6+阅读 · 2014年12月31日

基于似然函数的统计推断

国家自然科学基金

5+阅读 · 2014年12月31日

含有隐变量的因果结构学习与统计因果推断

国家自然科学基金

21+阅读 · 2013年12月31日

因果推断的统计方法

国家自然科学基金

26+阅读 · 2011年12月31日

因果推断及不完全数据的统计分析

国家自然科学基金

23+阅读 · 2008年12月31日

Estimation and Inference for Causal Explainability

Arxiv

0+阅读 · 3月6日

Robust Power and Sample Size Calculations in Quasi-likelihood Models: Methods and Practice

Arxiv

0+阅读 · 2月28日

Design-based inference for generalized causal effects in randomized experiments

Arxiv

0+阅读 · 2月20日

General sample size analysis for probabilities of causation: a delta method approach

Arxiv

0+阅读 · 2月19日

Sample size and power determination for assessing overall SNP effects in joint modeling of longitudinal and time-to-event data

Arxiv

0+阅读 · 2月16日

Simulating the Power of Statistical Tests: A Collection of R Examples

Arxiv

0+阅读 · 2月13日

Modern Causal Inference Approaches to Improve Power for Subgroup Analysis in Randomized Controlled Trials

Arxiv

0+阅读 · 2月11日

Estimating causal effects of functional treatments with modified functional treatment policies

Arxiv

0+阅读 · 2月9日

Identification and Debiased Learning of Causal Effects with General Instrumental Variables

Arxiv

0+阅读 · 2月7日

Optimal Sample Splitting for Observational Studies

Arxiv

0+阅读 · 1月30日

VIP会员

文章信息

相关主题

最新内容

BES：让语言模型通过双向进化搜索自我改进

BES：让语言模型通过双向进化搜索自我改进

专知会员服务

0+阅读 · 36分钟前

ICML 2026 | 揭开视觉语言模型计数瓶颈：看得到，却说不出

ICML 2026 | 揭开视觉语言模型计数瓶颈：看得到，却说不出

专知会员服务

0+阅读 · 37分钟前

以色列-美国-伊朗战争中的无人机：关键要点

以色列-美国-伊朗战争中的无人机：关键要点

专知会员服务

3+阅读 · 今天14:04

美以伊战争：首次人工智能战争——军事自主性困境

美以伊战争：首次人工智能战争——军事自主性困境

专知会员服务

3+阅读 · 今天13:54

《Palantir任务保障性软件安全标准（MA-S2）》

《Palantir任务保障性软件安全标准（MA-S2）》

专知会员服务

6+阅读 · 今天13:49

《美海军利用扩展现实增强知识流动研究》300页报告

《美海军利用扩展现实增强知识流动研究》300页报告

专知会员服务

4+阅读 · 今天13:38

基于声学的无人机检测技术综述

基于声学的无人机检测技术综述

专知会员服务

5+阅读 · 今天13:37

《当代混合战争分析框架：俄乌战争经验教训》

《当代混合战争分析框架：俄乌战争经验教训》

专知会员服务

5+阅读 · 今天13:11

生成式AI基础小册子绪论解读：一条数学地基路线，178页pdf

生成式AI基础小册子绪论解读：一条数学地基路线，178页pdf

专知会员服务

10+阅读 · 5月29日

AutoScientists：自组织智能体团队驱动长期科学实验

AutoScientists：自组织智能体团队驱动长期科学实验

专知会员服务

5+阅读 · 5月29日

《阿利·伯克级驱逐舰的战损修理：桌面推演结果》报告

《阿利·伯克级驱逐舰的战损修理：桌面推演结果》报告

专知会员服务

5+阅读 · 5月29日

战略前沿人工智能的再思考（中文）

战略前沿人工智能的再思考（中文）

专知会员服务

7+阅读 · 5月29日

《量化地基防空系统间接效应的博弈论方法》

《量化地基防空系统间接效应的博弈论方法》

专知会员服务

5+阅读 · 5月29日

传感器网络：美国如何探测来自伊朗的导弹与无人机

传感器网络：美国如何探测来自伊朗的导弹与无人机

专知会员服务

6+阅读 · 5月29日

《无人机战争中的经济不对称：伊朗“沙赫德-136”对抗以色列“铁穹”防御系统的案例研究》

《无人机战争中的经济不对称：伊朗“沙赫德-136”对抗以色列“铁穹”防御系统的案例研究》

专知会员服务

8+阅读 · 5月29日

相关VIP内容

基于因果推断的推荐系统去偏研究

基于因果推断的推荐系统去偏研究

专知会员服务

21+阅读 · 2024年11月10日

【牛津大学博士论文】用于因果学习与推理的可处理概率模型，240页pdf

【牛津大学博士论文】用于因果学习与推理的可处理概率模型，240页pdf

专知会员服务

46+阅读 · 2023年9月20日

【MIT博士论文】非参数因果推理的算法方法，424页pdf

【MIT博士论文】非参数因果推理的算法方法，424页pdf

专知会员服务

84+阅读 · 2022年9月20日

因果如何用于推荐？清华等最新《推荐系统中的因果推理》综述论文，29页pdf阐述因果推荐方法体系

因果如何用于推荐？清华等最新《推荐系统中的因果推理》综述论文，29页pdf阐述因果推荐方法体系

专知会员服务

48+阅读 · 2022年8月31日

最新《因果推断导论》，51页ppt，剑桥大学助理教授Qingyuan Zhao讲解

最新《因果推断导论》，51页ppt，剑桥大学助理教授Qingyuan Zhao讲解

专知会员服务

41+阅读 · 2022年8月28日

【经典书】统计学中的因果推断，156页pdf

【经典书】统计学中的因果推断，156页pdf

专知会员服务

98+阅读 · 2022年6月14日

《异构观测数据中的联合因果推理》美国艾莫利大学、微软、约翰霍普金斯大学、哈佛大学、斯坦福大学等联合发表最新论文63页PDF

《异构观测数据中的联合因果推理》美国艾莫利大学、微软、约翰霍普金斯大学、哈佛大学、斯坦福大学等联合发表最新论文63页PDF

专知会员服务

29+阅读 · 2022年4月28日

因果如何用于推荐？中科大最新WWW2022《因果推荐: 进展与未来方向》教程，附123页ppt

因果如何用于推荐？中科大最新WWW2022《因果推荐: 进展与未来方向》教程，附123页ppt

专知会员服务

108+阅读 · 2022年4月28日

最新《从观察数据发现因果性》，150页ppt

专知会员服务

66+阅读 · 2021年1月6日

最新「因果推断Causal Inference」综述论文38页pdf，Buffalo、Georgia、阿里巴巴、Virginia

专知会员服务

183+阅读 · 2020年2月11日

热门VIP内容

开通专知VIP会员享更多权益服务

ICML 2026 | 揭开视觉语言模型计数瓶颈：看得到，却说不出

美以伊战争：首次人工智能战争——军事自主性困境

BES：让语言模型通过双向进化搜索自我改进

以色列-美国-伊朗战争中的无人机：关键要点

相关资讯

《因果性与机器学习综述》2022最新40页报告，美国陆军研究实验室

《因果性与机器学习综述》2022最新40页报告，美国陆军研究实验室

专知

12+阅读 · 2022年11月25日

「因果推理」概述论文，13页pdf

「因果推理」概述论文，13页pdf

专知

16+阅读 · 2021年3月20日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知

11+阅读 · 2020年8月28日

基于深度元学习的因果推断新方法

基于深度元学习的因果推断新方法

图与推荐

12+阅读 · 2020年7月21日

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

专知

68+阅读 · 2020年2月11日

因果推理学习算法资源大列表

因果推理学习算法资源大列表

专知

27+阅读 · 2019年3月3日

哈佛大学Miguel Hernan科学家最新2019年《因果推断:概念与方法》书稿终版，280页讲解因果效应（附下载）

哈佛大学Miguel Hernan科学家最新2019年《因果推断:概念与方法》书稿终版，280页讲解因果效应（附下载）

专知

77+阅读 · 2019年1月3日

北京大学何洋波博士《因果推断和因果图模型》机器学习报告

北京大学何洋波博士《因果推断和因果图模型》机器学习报告

专知

103+阅读 · 2018年11月11日

相关性≠因果：概率图模型和do-calculus

相关性≠因果：概率图模型和do-calculus

论智

31+阅读 · 2018年10月29日

【机器学习基本理论】详解最大似然估计（MLE）、最大后验概率估计（MAP），以及贝叶斯公式的理解

【机器学习基本理论】详解最大似然估计（MLE）、最大后验概率估计（MAP），以及贝叶斯公式的理解

机器学习研究会

19+阅读 · 2018年3月11日

相关论文

Estimation and Inference for Causal Explainability

Arxiv

0+阅读 · 3月6日

Robust Power and Sample Size Calculations in Quasi-likelihood Models: Methods and Practice

Arxiv

0+阅读 · 2月28日

Design-based inference for generalized causal effects in randomized experiments

Arxiv

0+阅读 · 2月20日

General sample size analysis for probabilities of causation: a delta method approach

Arxiv

0+阅读 · 2月19日

Sample size and power determination for assessing overall SNP effects in joint modeling of longitudinal and time-to-event data

Arxiv

0+阅读 · 2月16日

Simulating the Power of Statistical Tests: A Collection of R Examples

Arxiv

0+阅读 · 2月13日

Modern Causal Inference Approaches to Improve Power for Subgroup Analysis in Randomized Controlled Trials

Arxiv

0+阅读 · 2月11日

Estimating causal effects of functional treatments with modified functional treatment policies

Arxiv

0+阅读 · 2月9日

Identification and Debiased Learning of Causal Effects with General Instrumental Variables

Arxiv

0+阅读 · 2月7日

Optimal Sample Splitting for Observational Studies

Arxiv

0+阅读 · 1月30日

相关基金

测量误差数据下部分线性模型有约束统计推断理论

国家自然科学基金

2+阅读 · 2015年12月31日

基于复杂数据的回归模型统计推断及其应用

国家自然科学基金

3+阅读 · 2015年12月31日

随机波动率模型的统计推断及数值解

国家自然科学基金

1+阅读 · 2015年12月31日

基于部分核实数据的统计推断及应用

国家自然科学基金

0+阅读 · 2014年12月31日

相依回归模型与扩散过程的统计推断及其应用

国家自然科学基金

1+阅读 · 2014年12月31日

概率抽样设计及其统计推断方法

国家自然科学基金

6+阅读 · 2014年12月31日

基于似然函数的统计推断

国家自然科学基金

5+阅读 · 2014年12月31日

含有隐变量的因果结构学习与统计因果推断

国家自然科学基金

21+阅读 · 2013年12月31日

因果推断的统计方法

国家自然科学基金

26+阅读 · 2011年12月31日

因果推断及不完全数据的统计分析

国家自然科学基金

23+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员