Evaluating the impact of outcome delay on the efficiency of sample size re-estimation - 专知论文

会员服务 ·

0

样本 · 试验 · 设计 · 工具 · 参数不确定性 ·

Evaluating the impact of outcome delay on the efficiency of sample size re-estimation

翻译：评估结果延迟对样本量重估计效率的影响

Aritra Mukherjee,Michael J Grayling,James J M S Wason

Sample size reestimation can be a powerful tool to ensure that a clinical trial meets its prespecified power requirements when uncertainty regarding a design parameter exists at the planning stage. However, long term primary endpoints can be harmful to the efficiency of this trial design. If recruitment is continued while treatment outcomes are awaited, long delay can potentially lead to a large number of pipeline participants being recruited in the trial that do not contribute to the interim analysis. This may lead to a larger number of recruited participants than are actually deemed required, resulting in an overpowered trial with high cost. This paper studies the exact impact of such outcome delay on the efficiency of internal pilot type SSR designs. The distribution of the final sample size post SSR is obtained under various delay lengths for both continuous and binary outcome data, how delay impacts the precision of the final sample size estimate is then discussed. Precisely, the impact of delay on this precision is assessed through RMSE, as well as two more novel metrics, termed the delay impact and cost. The results indicate that with increase in delay length, the delay impact increases, inflating average sample size and power. However, the severity of the effect of delayed outcomes depends highly on the exact trial setting. Trials where the reestimated sample size is smaller than originally planned suffer the most from delayed outcomes, often leading to an overpowered trial. However, the impact of delay is substantially less if the original planned sample size remains smaller than the reestimated sample size.

翻译：样本量重估计是一种强有力的工具，可确保在规划阶段存在设计参数不确定性时，临床试验满足预设的检验效能要求。然而，长期主要终点可能损害此类试验设计的效率。若在等待治疗结果期间继续招募受试者，长期延迟可能导致试验中招募大量未纳入期中分析的流水线参与者，使得实际招募人数超出实际所需，从而产生检验效能过高且成本高昂的试验。本文研究了这种结果延迟对内部预试验型SSR设计效率的确切影响。针对连续型和二分类结局数据，推导了不同延迟时长下SSR后最终样本量的分布，进而讨论了延迟如何影响最终样本量估计的精确度。具体而言，通过均方根误差以及两种新的衡量指标（称为延迟影响和成本）评估延迟对该精确度的作用。结果表明，随着延迟时长增加，延迟影响增大，导致平均样本量和检验效能膨胀。但延迟结果的严重程度高度依赖于具体的试验设置。当重估计样本量小于原计划时，试验受延迟结果影响最大，往往导致检验效能过高。然而，若原计划样本量仍小于重估计样本量，则延迟的影响显著降低。

0

相关内容

基于因果推断的推荐系统去偏研究

基于因果推断的推荐系统去偏研究

专知会员服务

21+阅读 · 2024年11月10日

事件抽取的再评价:过去、现在和未来的挑战

事件抽取的再评价:过去、现在和未来的挑战

专知会员服务

25+阅读 · 2023年11月28日

【AISTATS2023】基于上下文和混杂因素的因果效应估计，77页ppt

【AISTATS2023】基于上下文和混杂因素的因果效应估计，77页ppt

专知会员服务

30+阅读 · 2023年4月29日

【CVPR2023】正则化二阶影响的持续学习

【CVPR2023】正则化二阶影响的持续学习

专知会员服务

19+阅读 · 2023年4月22日

小样本目标检测研究综述

小样本目标检测研究综述

专知会员服务

57+阅读 · 2023年1月15日

【ICML2022】因果Transformer:估算反事实结果的因果, 附ppt

【ICML2022】因果Transformer:估算反事实结果的因果, 附ppt

专知会员服务

84+阅读 · 2022年7月20日

【清华大学】Delta调优:预训练语言模型参数有效方法的综合研究，Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

【清华大学】Delta调优:预训练语言模型参数有效方法的综合研究，Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

专知会员服务

26+阅读 · 2022年3月15日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

【帝国理工学院】医疗影像中「因果性」至关重要，Glocker这52页ppt讲述医疗机器学习因果性

【帝国理工学院】医疗影像中「因果性」至关重要，Glocker这52页ppt讲述医疗机器学习因果性

专知会员服务

51+阅读 · 2020年3月15日

【独立研究者I-Sheng Yang论文】因果机器学习损失函数（A Loss-Function for Causal Machine-Learning）

【独立研究者I-Sheng Yang论文】因果机器学习损失函数（A Loss-Function for Causal Machine-Learning）

专知会员服务

20+阅读 · 2020年1月7日

【NIPS2019】Infidelity and Sensitivity：模型可解释性方法的定量评估

【NIPS2019】Infidelity and Sensitivity：模型可解释性方法的定量评估

AINLP

19+阅读 · 2020年6月14日

【CVPR2020-北京大学】自适应间隔损失的提升小样本学习

【CVPR2020-北京大学】自适应间隔损失的提升小样本学习

专知

12+阅读 · 2020年6月9日

转化率预估(pCVR)系列--延迟预估模型（上篇）

转化率预估(pCVR)系列--延迟预估模型（上篇）

AINLP

31+阅读 · 2020年6月1日

小样本也能增量学习？CVPR 2020 Oral最新干货：小样本类增量学习

小样本也能增量学习？CVPR 2020 Oral最新干货：小样本类增量学习

CVer

54+阅读 · 2020年5月1日

【WWW2020-新加坡国立大学】知识图谱强化负采样的推荐系统，Reinforced Negative Sampling

【WWW2020-新加坡国立大学】知识图谱强化负采样的推荐系统，Reinforced Negative Sampling

专知

22+阅读 · 2020年3月14日

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

专知

68+阅读 · 2020年2月11日

从 ICLR 2019 一览小样本学习最新进展！

从 ICLR 2019 一览小样本学习最新进展！

AI科技评论

15+阅读 · 2019年6月9日

深度 | 推荐系统评估

深度 | 推荐系统评估

AI100

24+阅读 · 2019年3月16日

入门 | 什么是最大似然估计、最大后验估计以及贝叶斯参数估计

入门 | 什么是最大似然估计、最大后验估计以及贝叶斯参数估计

机器之心

11+阅读 · 2018年4月15日

【机器学习基本理论】详解最大似然估计（MLE）、最大后验概率估计（MAP），以及贝叶斯公式的理解

【机器学习基本理论】详解最大似然估计（MLE）、最大后验概率估计（MAP），以及贝叶斯公式的理解

机器学习研究会

19+阅读 · 2018年3月11日

云计算平台中大规模交互式服务长尾延迟消减关键技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

测量误差数据下部分线性模型有约束统计推断理论

国家自然科学基金

2+阅读 · 2015年12月31日

大型复杂医学领域本体质量评估理论研究

国家自然科学基金

1+阅读 · 2015年12月31日

处理效应差异中位数的有效估计

国家自然科学基金

0+阅读 · 2015年12月31日

网络本体质量及适应性的评估研究

国家自然科学基金

0+阅读 · 2015年12月31日

数据中心延迟敏感型应用尾端响应时延服务质量保障方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

样本特性对海洋遥感产品真实性检验的定量化影响研究

国家自然科学基金

0+阅读 · 2015年12月31日

测量误差数据下约束线性模型的有偏估计及变量选择研究

国家自然科学基金

0+阅读 · 2014年12月31日

随机延迟微分方程数值解的延迟依赖稳定性及自适应技术

国家自然科学基金

0+阅读 · 2014年12月31日

概率抽样设计及其统计推断方法

国家自然科学基金

6+阅读 · 2014年12月31日

Too Few or Too Many? Sample Size Estimation for Differential Abundance Studies

Arxiv

0+阅读 · 6月15日

Empirical stratification for treatment effect heterogeneity with post-treatment variables

Arxiv

0+阅读 · 6月9日

Orthogonal Learner for Estimating Heterogeneous Long-Term Treatment Effects

Arxiv

0+阅读 · 6月3日

Improving Longitudinal Targeted Maximum Likelihood Estimation in Target Trial Emulation using Joint Calibrated Weights

Arxiv

0+阅读 · 6月3日

Adaptive clinical trial design with delayed treatment effects using elicited prior distributions

Arxiv

0+阅读 · 6月1日

Evaluating causal indirect effects when mediators are left-censored by assay limit of quantification

Arxiv

0+阅读 · 5月30日

Design-Based Anytime-Valid Inference for Randomized Experiments with Delayed Outcomes and Staggered Entry

Arxiv

0+阅读 · 5月29日

Modeling Covariate Transition for Efficient Estimation of Longitudinal Treatment Effects in Randomized Experiments

Arxiv

0+阅读 · 5月29日

Joint Estimation of Marginal and Heterogeneous Treatment Effects

Arxiv

0+阅读 · 5月22日

Sample size and power calculations for causal inference with time-to-event outcomes

Arxiv

0+阅读 · 5月11日

VIP会员

文章信息

相关主题

参数不确定性

最新内容

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

专知会员服务

2+阅读 · 6月18日

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

专知会员服务

2+阅读 · 6月18日

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

专知会员服务

8+阅读 · 6月18日

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

专知会员服务

6+阅读 · 6月18日

ICML 2026 | FR3D：解耦自车运动的未来动态三维重建世界模型

ICML 2026 | FR3D：解耦自车运动的未来动态三维重建世界模型

专知会员服务

4+阅读 · 6月17日

【伯克利博士论文】迈向可扩展与自我演进的大语言模型智能体

【伯克利博士论文】迈向可扩展与自我演进的大语言模型智能体

专知会员服务

6+阅读 · 6月17日

学习数据的几何：形状空间分析数学综述

学习数据的几何：形状空间分析数学综述

专知会员服务

6+阅读 · 6月17日

《现代防空系统综述：架构、传感器、拦截器及新兴威胁环境对基础设施受限防御环境的影响》2026最新长综述

《现代防空系统综述：架构、传感器、拦截器及新兴威胁环境对基础设施受限防御环境的影响》2026最新长综述

专知会员服务

8+阅读 · 6月17日

定向能反无人机系统最新发展动态

定向能反无人机系统最新发展动态

专知会员服务

7+阅读 · 6月17日

从燃煤战舰到算法战争：水面指挥的永恒要求

从燃煤战舰到算法战争：水面指挥的永恒要求

专知会员服务

4+阅读 · 6月17日

《短程弹道再入飞行器拦截时间中的一项异常现象》

《短程弹道再入飞行器拦截时间中的一项异常现象》

专知会员服务

6+阅读 · 6月17日

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

专知会员服务

7+阅读 · 6月17日

美智库《战术级指挥控制的迫切要求：构建弹性机动式指挥控制网络》报告

美智库《战术级指挥控制的迫切要求：构建弹性机动式指挥控制网络》报告

专知会员服务

5+阅读 · 6月17日

《韩国国防政策与军备出口：韩国安全与国防政策如何塑造其国防工业与军备出口格局》最新100页报告

《韩国国防政策与军备出口：韩国安全与国防政策如何塑造其国防工业与军备出口格局》最新100页报告

专知会员服务

5+阅读 · 6月17日

ICML 2026 | VOTP：用视频基础模型与最优传输，让离线偏好强化学习只需少量反馈

ICML 2026 | VOTP：用视频基础模型与最优传输，让离线偏好强化学习只需少量反馈

专知会员服务

6+阅读 · 6月16日

相关VIP内容

基于因果推断的推荐系统去偏研究

基于因果推断的推荐系统去偏研究

专知会员服务

21+阅读 · 2024年11月10日

事件抽取的再评价:过去、现在和未来的挑战

事件抽取的再评价:过去、现在和未来的挑战

专知会员服务

25+阅读 · 2023年11月28日

【AISTATS2023】基于上下文和混杂因素的因果效应估计，77页ppt

【AISTATS2023】基于上下文和混杂因素的因果效应估计，77页ppt

专知会员服务

30+阅读 · 2023年4月29日

【CVPR2023】正则化二阶影响的持续学习

【CVPR2023】正则化二阶影响的持续学习

专知会员服务

19+阅读 · 2023年4月22日

小样本目标检测研究综述

小样本目标检测研究综述

专知会员服务

57+阅读 · 2023年1月15日

【ICML2022】因果Transformer:估算反事实结果的因果, 附ppt

【ICML2022】因果Transformer:估算反事实结果的因果, 附ppt

专知会员服务

84+阅读 · 2022年7月20日

【清华大学】Delta调优:预训练语言模型参数有效方法的综合研究，Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

【清华大学】Delta调优:预训练语言模型参数有效方法的综合研究，Delta Tuning: A Comprehensive Study of Parameter Efficient Methods for Pre-trained Language Models

专知会员服务

26+阅读 · 2022年3月15日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

【帝国理工学院】医疗影像中「因果性」至关重要，Glocker这52页ppt讲述医疗机器学习因果性

【帝国理工学院】医疗影像中「因果性」至关重要，Glocker这52页ppt讲述医疗机器学习因果性

专知会员服务

51+阅读 · 2020年3月15日

【独立研究者I-Sheng Yang论文】因果机器学习损失函数（A Loss-Function for Causal Machine-Learning）

【独立研究者I-Sheng Yang论文】因果机器学习损失函数（A Loss-Function for Causal Machine-Learning）

专知会员服务

20+阅读 · 2020年1月7日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

相关资讯

【NIPS2019】Infidelity and Sensitivity：模型可解释性方法的定量评估

【NIPS2019】Infidelity and Sensitivity：模型可解释性方法的定量评估

AINLP

19+阅读 · 2020年6月14日

【CVPR2020-北京大学】自适应间隔损失的提升小样本学习

【CVPR2020-北京大学】自适应间隔损失的提升小样本学习

专知

12+阅读 · 2020年6月9日

转化率预估(pCVR)系列--延迟预估模型（上篇）

转化率预估(pCVR)系列--延迟预估模型（上篇）

AINLP

31+阅读 · 2020年6月1日

小样本也能增量学习？CVPR 2020 Oral最新干货：小样本类增量学习

小样本也能增量学习？CVPR 2020 Oral最新干货：小样本类增量学习

CVer

54+阅读 · 2020年5月1日

【WWW2020-新加坡国立大学】知识图谱强化负采样的推荐系统，Reinforced Negative Sampling

【WWW2020-新加坡国立大学】知识图谱强化负采样的推荐系统，Reinforced Negative Sampling

专知

22+阅读 · 2020年3月14日

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

最新「因果推断Causal Inference」综述论文38页pdf，阿里巴巴、Buffalo、Georgia、Virginia

专知

68+阅读 · 2020年2月11日

从 ICLR 2019 一览小样本学习最新进展！

从 ICLR 2019 一览小样本学习最新进展！

AI科技评论

15+阅读 · 2019年6月9日

深度 | 推荐系统评估

深度 | 推荐系统评估

AI100

24+阅读 · 2019年3月16日

入门 | 什么是最大似然估计、最大后验估计以及贝叶斯参数估计

入门 | 什么是最大似然估计、最大后验估计以及贝叶斯参数估计

机器之心

11+阅读 · 2018年4月15日

【机器学习基本理论】详解最大似然估计（MLE）、最大后验概率估计（MAP），以及贝叶斯公式的理解

【机器学习基本理论】详解最大似然估计（MLE）、最大后验概率估计（MAP），以及贝叶斯公式的理解

机器学习研究会

19+阅读 · 2018年3月11日

相关论文

Too Few or Too Many? Sample Size Estimation for Differential Abundance Studies

Arxiv

0+阅读 · 6月15日

Empirical stratification for treatment effect heterogeneity with post-treatment variables

Arxiv

0+阅读 · 6月9日

Orthogonal Learner for Estimating Heterogeneous Long-Term Treatment Effects

Arxiv

0+阅读 · 6月3日

Improving Longitudinal Targeted Maximum Likelihood Estimation in Target Trial Emulation using Joint Calibrated Weights

Arxiv

0+阅读 · 6月3日

Adaptive clinical trial design with delayed treatment effects using elicited prior distributions

Arxiv

0+阅读 · 6月1日

Evaluating causal indirect effects when mediators are left-censored by assay limit of quantification

Arxiv

0+阅读 · 5月30日

Design-Based Anytime-Valid Inference for Randomized Experiments with Delayed Outcomes and Staggered Entry

Arxiv

0+阅读 · 5月29日

Modeling Covariate Transition for Efficient Estimation of Longitudinal Treatment Effects in Randomized Experiments

Arxiv

0+阅读 · 5月29日

Joint Estimation of Marginal and Heterogeneous Treatment Effects

Arxiv

0+阅读 · 5月22日

Sample size and power calculations for causal inference with time-to-event outcomes

Arxiv

0+阅读 · 5月11日

相关基金

云计算平台中大规模交互式服务长尾延迟消减关键技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

测量误差数据下部分线性模型有约束统计推断理论

国家自然科学基金

2+阅读 · 2015年12月31日

大型复杂医学领域本体质量评估理论研究

国家自然科学基金

1+阅读 · 2015年12月31日

处理效应差异中位数的有效估计

国家自然科学基金

0+阅读 · 2015年12月31日

网络本体质量及适应性的评估研究

国家自然科学基金

0+阅读 · 2015年12月31日

数据中心延迟敏感型应用尾端响应时延服务质量保障方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

样本特性对海洋遥感产品真实性检验的定量化影响研究

国家自然科学基金

0+阅读 · 2015年12月31日

测量误差数据下约束线性模型的有偏估计及变量选择研究

国家自然科学基金

0+阅读 · 2014年12月31日

随机延迟微分方程数值解的延迟依赖稳定性及自适应技术

国家自然科学基金

0+阅读 · 2014年12月31日

概率抽样设计及其统计推断方法

国家自然科学基金

6+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员