Detecting Where Effects Occur by Testing Hypotheses in Order - 专知论文

会员服务 ·

0

试验 · 多站点 · 统计显著性 · 显著性 · 包含 ·

Detecting Where Effects Occur by Testing Hypotheses in Order

翻译：基于假设顺序检验的效应发生位置检测

Jake Bowers,David Kim,Nuole Chen

Experimental evaluations of public policies often randomize a new intervention within many sites or blocks. After a report of an overall result -- statistically significant or not -- the natural question from a policy maker is: \emph{where} did any effects occur? Standard adjustments for multiple testing provide little power to answer this question. In simulations modeled after a 44-block education trial, the Hommel adjustment -- among the most powerful procedures controlling the family-wise error rate (FWER) -- detects effects in only 11\% of truly non-null blocks. We develop a procedure that tests hypotheses top-down through a tree: test the overall null at the root, then groups of blocks, then individual blocks, stopping any branch where the null is not rejected. In the same 44-block design, this approach detects effects in 44\% of non-null blocks -- roughly four times the detection rate. A stopping rule and valid tests at each node suffice for weak FWER control. We show that the strong-sense FWER depends on how rejection probabilities accumulate along paths through the tree. This yields a diagnostic: when power decays fast enough relative to branching, no adjustment is needed; otherwise, an adaptive $α$-adjustment restores control. We apply the method to 25 MDRC education trials and provide an R package, \texttt{manytestsr}.

翻译：公共政策的实验评估通常在许多站点或区块内随机实施新干预措施。在报告总体结果（无论是否具有统计显著性）后，政策制定者自然会提出这样的问题：效应究竟发生在\emph{何处}？传统的多重检验校正方法对此问题的检测功效有限。在以一项包含44个区块的教育试验为模型的模拟中，Hommel校正——作为控制族错误率（FWER）功效最强的程序之一——仅在11%的真实非零效应区块中检测到效应。本研究开发了一种通过树结构自上而下检验假设的程序：在根节点检验整体零假设，随后检验区块组假设，最后检验单个区块假设，并在零假设未被拒绝的任何分支处停止检验。在相同的44区块设计中，该方法在44%的非零效应区块中检测到效应——检测率提升约四倍。每个节点的停止规则与有效检验足以实现弱FWER控制。我们证明强FWER控制取决于拒绝概率沿树路径的累积方式。由此推导出诊断准则：当检验功效相对于分支衰减足够快时，无需进行校正；否则，自适应$α$调整可恢复控制。我们将该方法应用于25项MDRC教育试验，并提供了R语言包\texttt{manytestsr}。

0

相关内容

【AISTATS2023】基于上下文和混杂因素的因果效应估计，77页ppt

【AISTATS2023】基于上下文和混杂因素的因果效应估计，77页ppt

专知会员服务

30+阅读 · 2023年4月29日

最新《因果推断导论》，51页ppt，剑桥大学助理教授Qingyuan Zhao讲解

最新《因果推断导论》，51页ppt，剑桥大学助理教授Qingyuan Zhao讲解

专知会员服务

41+阅读 · 2022年8月28日

因果机器学习模型-核方法:治疗效果、反事实、中介和代理，附72页ppt与视频

因果机器学习模型-核方法:治疗效果、反事实、中介和代理，附72页ppt与视频

专知会员服务

47+阅读 · 2022年7月17日

《异构观测数据中的联合因果推理》美国艾莫利大学、微软、约翰霍普金斯大学、哈佛大学、斯坦福大学等联合发表最新论文63页PDF

《异构观测数据中的联合因果推理》美国艾莫利大学、微软、约翰霍普金斯大学、哈佛大学、斯坦福大学等联合发表最新论文63页PDF

专知会员服务

29+阅读 · 2022年4月28日

带核的因果模型:治疗效果，反事实，调解，和代理，57页ppt

带核的因果模型:治疗效果，反事实，调解，和代理，57页ppt

专知会员服务

31+阅读 · 2022年2月21日

因果关联学习，Causal Relational Learning

因果关联学习，Causal Relational Learning

专知会员服务

185+阅读 · 2020年4月21日

【ACL2020】生成事实验证解释，Generating Fact Checking Explanations

【ACL2020】生成事实验证解释，Generating Fact Checking Explanations

专知会员服务

17+阅读 · 2020年4月15日

【斯坦福大学】Dropout的隐性和显性正则化效应，Regularization Effects

【斯坦福大学】Dropout的隐性和显性正则化效应，Regularization Effects

专知会员服务

34+阅读 · 2020年3月4日

最新「因果推断Causal Inference」综述论文38页pdf，Buffalo、Georgia、阿里巴巴、Virginia

专知会员服务

183+阅读 · 2020年2月11日

【Google AI新论文EfficientDet】规模化高效化的物体检测，EfficientDet: Scalable and Efficient Object Detection(附pdf)

【Google AI新论文EfficientDet】规模化高效化的物体检测，EfficientDet: Scalable and Efficient Object Detection(附pdf)

专知会员服务

27+阅读 · 2019年11月24日

异常检测（Anomaly Detection）综述

异常检测（Anomaly Detection）综述

极市平台

20+阅读 · 2020年10月24日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知

11+阅读 · 2020年8月28日

AB实验在滴滴数据驱动中的应用

AB实验在滴滴数据驱动中的应用

DataFunTalk

15+阅读 · 2020年5月31日

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

专知

68+阅读 · 2020年2月11日

论文浅尝 | 时序与因果关系联合推理

论文浅尝 | 时序与因果关系联合推理

开放知识图谱

36+阅读 · 2019年6月23日

最新最权威《深度学习显著目标检测综述》论文代码数据发布，带你全面了解显著目标检测方法

最新最权威《深度学习显著目标检测综述》论文代码数据发布，带你全面了解显著目标检测方法

专知

79+阅读 · 2019年4月24日

论文浅尝 | 基于局内去噪和迁移学习的关系抽取

论文浅尝 | 基于局内去噪和迁移学习的关系抽取

开放知识图谱

16+阅读 · 2018年12月2日

原创 | Attention Modeling for Targeted Sentiment

原创 | Attention Modeling for Targeted Sentiment

黑龙江大学自然语言处理实验室

25+阅读 · 2017年11月5日

回归预测&时间序列预测

回归预测&时间序列预测

GBASE数据工程部数据团队

44+阅读 · 2017年5月17日

侦测欺诈交易（异常点检测）

侦测欺诈交易（异常点检测）

GBASE数据工程部数据团队

20+阅读 · 2017年5月10日

多重假设检验中的k-FWER控制

国家自然科学基金

0+阅读 · 2015年12月31日

处理效应差异中位数的有效估计

国家自然科学基金

0+阅读 · 2015年12月31日

半参数回归模型中随机误差分布的检验问题

国家自然科学基金

2+阅读 · 2015年12月31日

基于时序相似性的机场噪声监测点交互预测

国家自然科学基金

1+阅读 · 2015年12月31日

随机对策的首达目标准则及其有限逼近

国家自然科学基金

0+阅读 · 2015年12月31日

试验设计中的模型选择

国家自然科学基金

6+阅读 · 2014年12月31日

问责机制何以奏效？面向公共部门政策执行的实证研究

国家自然科学基金

1+阅读 · 2014年12月31日

面向微博数据的位置相关事件检测和时空异常聚类模式挖掘研究

国家自然科学基金

0+阅读 · 2014年12月31日

多重比较中控制FDR的有效检验方法

国家自然科学基金

0+阅读 · 2014年12月31日

劣者淘汰两阶段自适应临床试验的设计和分析

国家自然科学基金

0+阅读 · 2014年12月31日

Conditional Distributional Treatment Effects: Doubly Robust Estimation and Testing

Conditional Distributional Treatment Effects: Doubly Robust Estimation and Testing

Arxiv

0+阅读 · 3月17日

Post-Experiment Decisions: The Dual Adjustments for Rollout and Downstream Optimizations

Arxiv

0+阅读 · 3月11日

Predictive Power Analysis of Multiple Test Procedures Under Arbitrary Dependence

Arxiv

0+阅读 · 3月7日

Testing Full Mediation of Treatment Effects and the Identifiability of Causal Mechanisms

Arxiv

0+阅读 · 3月4日

Detecting Where Effects Occur by Testing Hypotheses in Order

Arxiv

0+阅读 · 2月24日

Safe hypotheses testing with application to order restricted inference

Arxiv

0+阅读 · 2月17日

Anytime-Valid Inference in Adaptive Experiments: Covariate Adjustment and Balanced Power

Arxiv

0+阅读 · 2月13日

Modern Causal Inference Approaches to Improve Power for Subgroup Analysis in Randomized Controlled Trials

Arxiv

0+阅读 · 2月11日

Consistency Assessment of Regional Treatment Effect for Multi-Regional Clinical Trials in the Presence of Covariate Shift

Arxiv

0+阅读 · 2月7日

Rank-Learner: Orthogonal Ranking of Treatment Effects

Arxiv

0+阅读 · 2月3日

VIP会员

文章信息

相关主题

统计显著性

最新内容

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

3+阅读 · 6月22日

综述 | 3D场景图：开放挑战与未来方向

综述 | 3D场景图：开放挑战与未来方向

专知会员服务

4+阅读 · 6月22日

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

专知会员服务

6+阅读 · 6月22日

21世纪的无人机战争

21世纪的无人机战争

专知会员服务

4+阅读 · 6月22日

《伊朗与以色列-美国热战及其对数字技术的影响》

《伊朗与以色列-美国热战及其对数字技术的影响》

专知会员服务

5+阅读 · 6月22日

《量子技术的军事任务技术适配与利用》

《量子技术的军事任务技术适配与利用》

专知会员服务

5+阅读 · 6月22日

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

专知会员服务

6+阅读 · 6月22日

美国从乌克兰无人机战争中学习经验

美国从乌克兰无人机战争中学习经验

专知会员服务

7+阅读 · 6月21日

ICML 2026 | 面向视觉语言模型的语义鲁棒性认证

ICML 2026 | 面向视觉语言模型的语义鲁棒性认证

专知会员服务

5+阅读 · 6月21日

综述 | 智能体电子设计自动化：从“交接有效性”重新理解Agentic EDA

综述 | 智能体电子设计自动化：从“交接有效性”重新理解Agentic EDA

专知会员服务

8+阅读 · 6月21日

深入解读 Palantir AIP：全球最具争议的人工智能平台究竟如何运作

深入解读 Palantir AIP：全球最具争议的人工智能平台究竟如何运作

专知会员服务

21+阅读 · 6月20日

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

专知会员服务

5+阅读 · 6月19日

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

专知会员服务

8+阅读 · 6月19日

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

专知会员服务

7+阅读 · 6月18日

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

专知会员服务

9+阅读 · 6月18日

相关VIP内容

【AISTATS2023】基于上下文和混杂因素的因果效应估计，77页ppt

【AISTATS2023】基于上下文和混杂因素的因果效应估计，77页ppt

专知会员服务

30+阅读 · 2023年4月29日

最新《因果推断导论》，51页ppt，剑桥大学助理教授Qingyuan Zhao讲解

最新《因果推断导论》，51页ppt，剑桥大学助理教授Qingyuan Zhao讲解

专知会员服务

41+阅读 · 2022年8月28日

因果机器学习模型-核方法:治疗效果、反事实、中介和代理，附72页ppt与视频

因果机器学习模型-核方法:治疗效果、反事实、中介和代理，附72页ppt与视频

专知会员服务

47+阅读 · 2022年7月17日

《异构观测数据中的联合因果推理》美国艾莫利大学、微软、约翰霍普金斯大学、哈佛大学、斯坦福大学等联合发表最新论文63页PDF

《异构观测数据中的联合因果推理》美国艾莫利大学、微软、约翰霍普金斯大学、哈佛大学、斯坦福大学等联合发表最新论文63页PDF

专知会员服务

29+阅读 · 2022年4月28日

带核的因果模型:治疗效果，反事实，调解，和代理，57页ppt

带核的因果模型:治疗效果，反事实，调解，和代理，57页ppt

专知会员服务

31+阅读 · 2022年2月21日

因果关联学习，Causal Relational Learning

因果关联学习，Causal Relational Learning

专知会员服务

185+阅读 · 2020年4月21日

【ACL2020】生成事实验证解释，Generating Fact Checking Explanations

【ACL2020】生成事实验证解释，Generating Fact Checking Explanations

专知会员服务

17+阅读 · 2020年4月15日

【斯坦福大学】Dropout的隐性和显性正则化效应，Regularization Effects

【斯坦福大学】Dropout的隐性和显性正则化效应，Regularization Effects

专知会员服务

34+阅读 · 2020年3月4日

最新「因果推断Causal Inference」综述论文38页pdf，Buffalo、Georgia、阿里巴巴、Virginia

专知会员服务

183+阅读 · 2020年2月11日

【Google AI新论文EfficientDet】规模化高效化的物体检测，EfficientDet: Scalable and Efficient Object Detection(附pdf)

【Google AI新论文EfficientDet】规模化高效化的物体检测，EfficientDet: Scalable and Efficient Object Detection(附pdf)

专知会员服务

27+阅读 · 2019年11月24日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 3D场景图：开放挑战与未来方向

21世纪的无人机战争

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

相关资讯

异常检测（Anomaly Detection）综述

异常检测（Anomaly Detection）综述

极市平台

20+阅读 · 2020年10月24日

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

【KDD2020-Tutorial】因果推理与稳定学习，Causal Inference and Stable Learning

专知

11+阅读 · 2020年8月28日

AB实验在滴滴数据驱动中的应用

AB实验在滴滴数据驱动中的应用

DataFunTalk

15+阅读 · 2020年5月31日

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

专知

68+阅读 · 2020年2月11日

论文浅尝 | 时序与因果关系联合推理

论文浅尝 | 时序与因果关系联合推理

开放知识图谱

36+阅读 · 2019年6月23日

最新最权威《深度学习显著目标检测综述》论文代码数据发布，带你全面了解显著目标检测方法

最新最权威《深度学习显著目标检测综述》论文代码数据发布，带你全面了解显著目标检测方法

专知

79+阅读 · 2019年4月24日

论文浅尝 | 基于局内去噪和迁移学习的关系抽取

论文浅尝 | 基于局内去噪和迁移学习的关系抽取

开放知识图谱

16+阅读 · 2018年12月2日

原创 | Attention Modeling for Targeted Sentiment

原创 | Attention Modeling for Targeted Sentiment

黑龙江大学自然语言处理实验室

25+阅读 · 2017年11月5日

回归预测&时间序列预测

回归预测&时间序列预测

GBASE数据工程部数据团队

44+阅读 · 2017年5月17日

侦测欺诈交易（异常点检测）

侦测欺诈交易（异常点检测）

GBASE数据工程部数据团队

20+阅读 · 2017年5月10日

相关论文

Conditional Distributional Treatment Effects: Doubly Robust Estimation and Testing

Conditional Distributional Treatment Effects: Doubly Robust Estimation and Testing

Arxiv

0+阅读 · 3月17日

Post-Experiment Decisions: The Dual Adjustments for Rollout and Downstream Optimizations

Arxiv

0+阅读 · 3月11日

Predictive Power Analysis of Multiple Test Procedures Under Arbitrary Dependence

Arxiv

0+阅读 · 3月7日

Testing Full Mediation of Treatment Effects and the Identifiability of Causal Mechanisms

Arxiv

0+阅读 · 3月4日

Detecting Where Effects Occur by Testing Hypotheses in Order

Arxiv

0+阅读 · 2月24日

Safe hypotheses testing with application to order restricted inference

Arxiv

0+阅读 · 2月17日

Anytime-Valid Inference in Adaptive Experiments: Covariate Adjustment and Balanced Power

Arxiv

0+阅读 · 2月13日

Modern Causal Inference Approaches to Improve Power for Subgroup Analysis in Randomized Controlled Trials

Arxiv

0+阅读 · 2月11日

Consistency Assessment of Regional Treatment Effect for Multi-Regional Clinical Trials in the Presence of Covariate Shift

Arxiv

0+阅读 · 2月7日

Rank-Learner: Orthogonal Ranking of Treatment Effects

Arxiv

0+阅读 · 2月3日

相关基金

多重假设检验中的k-FWER控制

国家自然科学基金

0+阅读 · 2015年12月31日

处理效应差异中位数的有效估计

国家自然科学基金

0+阅读 · 2015年12月31日

半参数回归模型中随机误差分布的检验问题

国家自然科学基金

2+阅读 · 2015年12月31日

基于时序相似性的机场噪声监测点交互预测

国家自然科学基金

1+阅读 · 2015年12月31日

随机对策的首达目标准则及其有限逼近

国家自然科学基金

0+阅读 · 2015年12月31日

试验设计中的模型选择

国家自然科学基金

6+阅读 · 2014年12月31日

问责机制何以奏效？面向公共部门政策执行的实证研究

国家自然科学基金

1+阅读 · 2014年12月31日

面向微博数据的位置相关事件检测和时空异常聚类模式挖掘研究

国家自然科学基金

0+阅读 · 2014年12月31日

多重比较中控制FDR的有效检验方法

国家自然科学基金

0+阅读 · 2014年12月31日

劣者淘汰两阶段自适应临床试验的设计和分析

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员