Design-Based Anytime-Valid Inference for Randomized Experiments with Delayed Outcomes and Staggered Entry - 专知论文

会员服务 ·

0

置信度 · 估计/估计量 · 估计误差 · 推断 · 在线 ·

Design-Based Anytime-Valid Inference for Randomized Experiments with Delayed Outcomes and Staggered Entry

翻译：基于设计的含延迟结果与交错进入的随机实验可随时推断的有效性

Michael Lindon,Nathan Kallus

Delayed outcomes are ubiquitous in online experimentation: treatment can affect whether an outcome occurs, when it occurs, and its realized value. To accommodate staggered entry while remaining robust to environmental nonstationarity and unit-level heterogeneity, we adopt a design-based perspective and target the sample cumulative reward in each arm as a function of calendar time. Our confidence sequences allow practitioners to continuously monitor the counterfactual incremental reward, such as revenue, that would have been realized by calendar time $t$ had all entered units been assigned to treatment rather than control. The main technical challenge is the choice of design-based filtration, complicated by the presence of asynchronous potential outcome times. We show that the IPW treatment-effect estimation error is not a martingale with respect to any filtration, while each arm-specific IPW estimation error is a martingale with respect to a carefully chosen arm-specific event-time filtration. We therefore construct a confidence sequence for the treatment effect by combining two arm-level confidence sequences with a union bound, and further demonstrate that this can outperform the traditional design-based variance upper bound. Finally, we characterize the class of augmentations for which the per-arm AIPW estimation error remains a martingale.

翻译：延迟结果在在线实验中普遍存在：处理可能影响结果是否发生、发生的时间及其实现值。为适应交错进入同时保持对环境非平稳性和单位异质性的鲁棒性，我们采用基于设计的视角，将各臂的样本累积奖励作为日历时间的函数。我们的置信序列使从业者能够连续监测反事实增量奖励（例如收入），即在日历时间 $t$ 时，若所有已进入单位被分配至处理组而非对照组时将实现的奖励。主要技术挑战在于基于设计的过滤选择，由于存在异步潜在结果时间而变得复杂。我们证明，IPW处理效应估计误差相对于任何过滤均不是鞅，而各臂特定的IPW估计误差相对于精心选择的各臂特定事件时间过滤是鞅。因此，我们通过结合两个臂级置信序列与联合界，构建了处理效应的置信序列，并进一步证明这可以优于传统的基于设计方差上界。最后，我们描述了使每臂AIPW估计误差保持为鞅的增强类。

0

相关内容

置信度

【EPFL博士论文】因果推断的方法学进展：实验、识别与估计

【EPFL博士论文】因果推断的方法学进展：实验、识别与估计

专知会员服务

16+阅读 · 2025年11月5日

基于因果推断的推荐系统去偏研究

基于因果推断的推荐系统去偏研究

专知会员服务

21+阅读 · 2024年11月10日

索邦大学121页博士论文《时间序列中的无监督异常检测》

索邦大学121页博士论文《时间序列中的无监督异常检测》

专知会员服务

104+阅读 · 2022年7月25日

【ICML2022】因果Transformer:估算反事实结果的因果, 附ppt

【ICML2022】因果Transformer:估算反事实结果的因果, 附ppt

专知会员服务

84+阅读 · 2022年7月20日

【ICML2022】用神经控制微分方程建立反事实结果的连续时间模型

【ICML2022】用神经控制微分方程建立反事实结果的连续时间模型

专知会员服务

35+阅读 · 2022年6月24日

反事实学习如何用于推荐！看RecSys2021教程《推荐系统反事实学习和评估:基础、实施和最新进展》，

专知会员服务

35+阅读 · 2021年9月30日

因果推断，Causal Inference：The Mixtape

因果推断，Causal Inference：The Mixtape

专知会员服务

110+阅读 · 2021年8月27日

清华大学刘云新获MobiSys 2021 最佳论文奖：精准预测深度学习模型在边缘设备上的推理延迟

专知会员服务

33+阅读 · 2021年7月17日

【ICML2020投稿论文-DeepMind】时序差分学习的推理与泛化，Temporal Difference Learning

专知会员服务

26+阅读 · 2020年3月16日

最新「因果推断Causal Inference」综述论文38页pdf，Buffalo、Georgia、阿里巴巴、Virginia

专知会员服务

183+阅读 · 2020年2月11日

《因果性与机器学习综述》2022最新40页报告，美国陆军研究实验室

《因果性与机器学习综述》2022最新40页报告，美国陆军研究实验室

专知

12+阅读 · 2022年11月25日

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

专知

18+阅读 · 2022年4月9日

「因果推理」概述论文，13页pdf

「因果推理」概述论文，13页pdf

专知

16+阅读 · 2021年3月20日

基于深度元学习的因果推断新方法

基于深度元学习的因果推断新方法

图与推荐

12+阅读 · 2020年7月21日

转化率预估(pCVR)系列--延迟预估模型（上篇）

转化率预估(pCVR)系列--延迟预估模型（上篇）

AINLP

31+阅读 · 2020年6月1日

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

专知

68+阅读 · 2020年2月11日

论文浅尝 | 时序与因果关系联合推理

论文浅尝 | 时序与因果关系联合推理

开放知识图谱

36+阅读 · 2019年6月23日

相关性≠因果：概率图模型和do-calculus

相关性≠因果：概率图模型和do-calculus

论智

31+阅读 · 2018年10月29日

时序异常检测算法概览

时序异常检测算法概览

论智

29+阅读 · 2018年8月30日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

带跳随机时滞微分方程解的高效快速算法设计及其在美式未定权益定价中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

半参数回归模型中随机误差分布的检验问题

国家自然科学基金

2+阅读 · 2015年12月31日

数据中心延迟敏感型应用尾端响应时延服务质量保障方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

分数阶时滞随机微分方程中的随机共振现象与行为研究

国家自然科学基金

0+阅读 · 2015年12月31日

事件触发机制下随机多智能体系统的有限时间一致性研究

国家自然科学基金

2+阅读 · 2015年12月31日

稳健随机均值模型在时空数据分析中的应用

国家自然科学基金

1+阅读 · 2014年12月31日

相依回归模型与扩散过程的统计推断及其应用

国家自然科学基金

1+阅读 · 2014年12月31日

超线性增长条件下的混杂型随机时滞微分方程

国家自然科学基金

0+阅读 · 2014年12月31日

随机延迟微分方程数值解的延迟依赖稳定性及自适应技术

国家自然科学基金

0+阅读 · 2014年12月31日

概率抽样设计及其统计推断方法

国家自然科学基金

6+阅读 · 2014年12月31日

Causal Inference with Missing Exposures and Missing Outcomes

Arxiv

0+阅读 · 6月16日

Detecting Where Effects Occur by Testing Hypotheses in Order

Arxiv

0+阅读 · 6月13日

Existence Precedes Value: Joint Modeling of Observational Existence and Evolving States in Time Series Forecasting

Arxiv

0+阅读 · 6月11日

Learning to Bet for Horizon-Aware Anytime-Valid Testing

Arxiv

0+阅读 · 6月2日

Adaptive clinical trial design with delayed treatment effects using elicited prior distributions

Arxiv

0+阅读 · 6月1日

Anytime-valid testing with e-values and confirmatory adaptive designs

Arxiv

0+阅读 · 5月30日

Debiased inference for stochastic treatment interventions with survival outcomes

Arxiv

0+阅读 · 5月29日

Bayesian Estimation of Cohort-Time-Stratum Specific Effects in Staggered Difference-in-Differences

Arxiv

0+阅读 · 5月21日

Evaluating the impact of outcome delay on the efficiency of sample size re-estimation

Arxiv

0+阅读 · 5月12日

Statistical Design of Pragmatic Trials Using Electronic Health Record Data when Outcome Assessments are Uncontrolled and Irregular

Arxiv

0+阅读 · 5月8日

VIP会员

文章信息

相关主题

估计/估计量

最新内容

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

4+阅读 · 6月22日

综述 | 3D场景图：开放挑战与未来方向

综述 | 3D场景图：开放挑战与未来方向

专知会员服务

6+阅读 · 6月22日

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

专知会员服务

6+阅读 · 6月22日

21世纪的无人机战争

21世纪的无人机战争

专知会员服务

4+阅读 · 6月22日

《伊朗与以色列-美国热战及其对数字技术的影响》

《伊朗与以色列-美国热战及其对数字技术的影响》

专知会员服务

5+阅读 · 6月22日

《量子技术的军事任务技术适配与利用》

《量子技术的军事任务技术适配与利用》

专知会员服务

5+阅读 · 6月22日

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

专知会员服务

6+阅读 · 6月22日

美国从乌克兰无人机战争中学习经验

美国从乌克兰无人机战争中学习经验

专知会员服务

7+阅读 · 6月21日

ICML 2026 | 面向视觉语言模型的语义鲁棒性认证

ICML 2026 | 面向视觉语言模型的语义鲁棒性认证

专知会员服务

5+阅读 · 6月21日

综述 | 智能体电子设计自动化：从“交接有效性”重新理解Agentic EDA

综述 | 智能体电子设计自动化：从“交接有效性”重新理解Agentic EDA

专知会员服务

8+阅读 · 6月21日

深入解读 Palantir AIP：全球最具争议的人工智能平台究竟如何运作

深入解读 Palantir AIP：全球最具争议的人工智能平台究竟如何运作

专知会员服务

22+阅读 · 6月20日

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

专知会员服务

5+阅读 · 6月19日

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

专知会员服务

8+阅读 · 6月19日

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

专知会员服务

7+阅读 · 6月18日

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

专知会员服务

9+阅读 · 6月18日

相关VIP内容

【EPFL博士论文】因果推断的方法学进展：实验、识别与估计

【EPFL博士论文】因果推断的方法学进展：实验、识别与估计

专知会员服务

16+阅读 · 2025年11月5日

基于因果推断的推荐系统去偏研究

基于因果推断的推荐系统去偏研究

专知会员服务

21+阅读 · 2024年11月10日

索邦大学121页博士论文《时间序列中的无监督异常检测》

索邦大学121页博士论文《时间序列中的无监督异常检测》

专知会员服务

104+阅读 · 2022年7月25日

【ICML2022】因果Transformer:估算反事实结果的因果, 附ppt

【ICML2022】因果Transformer:估算反事实结果的因果, 附ppt

专知会员服务

84+阅读 · 2022年7月20日

【ICML2022】用神经控制微分方程建立反事实结果的连续时间模型

【ICML2022】用神经控制微分方程建立反事实结果的连续时间模型

专知会员服务

35+阅读 · 2022年6月24日

反事实学习如何用于推荐！看RecSys2021教程《推荐系统反事实学习和评估:基础、实施和最新进展》，

专知会员服务

35+阅读 · 2021年9月30日

因果推断，Causal Inference：The Mixtape

因果推断，Causal Inference：The Mixtape

专知会员服务

110+阅读 · 2021年8月27日

清华大学刘云新获MobiSys 2021 最佳论文奖：精准预测深度学习模型在边缘设备上的推理延迟

专知会员服务

33+阅读 · 2021年7月17日

【ICML2020投稿论文-DeepMind】时序差分学习的推理与泛化，Temporal Difference Learning

专知会员服务

26+阅读 · 2020年3月16日

最新「因果推断Causal Inference」综述论文38页pdf，Buffalo、Georgia、阿里巴巴、Virginia

专知会员服务

183+阅读 · 2020年2月11日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 3D场景图：开放挑战与未来方向

21世纪的无人机战争

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

相关资讯

《因果性与机器学习综述》2022最新40页报告，美国陆军研究实验室

《因果性与机器学习综述》2022最新40页报告，美国陆军研究实验室

专知

12+阅读 · 2022年11月25日

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

【干货书】基于统计和机器学习的实用时间序列分析预测，Time Series Analysis Prediction

专知

18+阅读 · 2022年4月9日

「因果推理」概述论文，13页pdf

「因果推理」概述论文，13页pdf

专知

16+阅读 · 2021年3月20日

基于深度元学习的因果推断新方法

基于深度元学习的因果推断新方法

图与推荐

12+阅读 · 2020年7月21日

转化率预估(pCVR)系列--延迟预估模型（上篇）

转化率预估(pCVR)系列--延迟预估模型（上篇）

AINLP

31+阅读 · 2020年6月1日

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

专知

68+阅读 · 2020年2月11日

论文浅尝 | 时序与因果关系联合推理

论文浅尝 | 时序与因果关系联合推理

开放知识图谱

36+阅读 · 2019年6月23日

相关性≠因果：概率图模型和do-calculus

相关性≠因果：概率图模型和do-calculus

论智

31+阅读 · 2018年10月29日

时序异常检测算法概览

时序异常检测算法概览

论智

29+阅读 · 2018年8月30日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Causal Inference with Missing Exposures and Missing Outcomes

Arxiv

0+阅读 · 6月16日

Detecting Where Effects Occur by Testing Hypotheses in Order

Arxiv

0+阅读 · 6月13日

Existence Precedes Value: Joint Modeling of Observational Existence and Evolving States in Time Series Forecasting

Arxiv

0+阅读 · 6月11日

Learning to Bet for Horizon-Aware Anytime-Valid Testing

Arxiv

0+阅读 · 6月2日

Adaptive clinical trial design with delayed treatment effects using elicited prior distributions

Arxiv

0+阅读 · 6月1日

Anytime-valid testing with e-values and confirmatory adaptive designs

Arxiv

0+阅读 · 5月30日

Debiased inference for stochastic treatment interventions with survival outcomes

Arxiv

0+阅读 · 5月29日

Bayesian Estimation of Cohort-Time-Stratum Specific Effects in Staggered Difference-in-Differences

Arxiv

0+阅读 · 5月21日

Evaluating the impact of outcome delay on the efficiency of sample size re-estimation

Arxiv

0+阅读 · 5月12日

Statistical Design of Pragmatic Trials Using Electronic Health Record Data when Outcome Assessments are Uncontrolled and Irregular

Arxiv

0+阅读 · 5月8日

相关基金

带跳随机时滞微分方程解的高效快速算法设计及其在美式未定权益定价中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

半参数回归模型中随机误差分布的检验问题

国家自然科学基金

2+阅读 · 2015年12月31日

数据中心延迟敏感型应用尾端响应时延服务质量保障方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

分数阶时滞随机微分方程中的随机共振现象与行为研究

国家自然科学基金

0+阅读 · 2015年12月31日

事件触发机制下随机多智能体系统的有限时间一致性研究

国家自然科学基金

2+阅读 · 2015年12月31日

稳健随机均值模型在时空数据分析中的应用

国家自然科学基金

1+阅读 · 2014年12月31日

相依回归模型与扩散过程的统计推断及其应用

国家自然科学基金

1+阅读 · 2014年12月31日

超线性增长条件下的混杂型随机时滞微分方程

国家自然科学基金

0+阅读 · 2014年12月31日

随机延迟微分方程数值解的延迟依赖稳定性及自适应技术

国家自然科学基金

0+阅读 · 2014年12月31日

概率抽样设计及其统计推断方法

国家自然科学基金

6+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员