Propensity Score Propagation: A General Framework for Design-Based Inference with Unknown Propensity Scores

Design-based inference, also known as randomization-based or finite-population inference, provides a principled framework for causal and descriptive analyses that attribute randomness solely to the design mechanism (e.g., treatment assignment, sampling, or missingness) without imposing distributional or modeling assumptions on the outcome data of study units. Despite its conceptual appeal and long history, this framework becomes challenging to apply when the underlying design probabilities (i.e., propensity scores) are unknown, as is common in observational studies, real-world surveys, and missing-data settings. Existing plug-in or matching-based approaches either ignore the uncertainty stemming from estimated propensity scores or rely on the post-matching uniform-propensity condition (an assumption typically violated when there are multiple or continuous covariates), leading to systematic under-coverage. Finite-population M-estimation partially mitigates these issues but remains limited to parametric propensity score models. In this work, we introduce propensity score propagation, a general framework for valid design-based inference with unknown propensity scores. The framework introduces a regeneration-and-union procedure that automatically propagates uncertainty in propensity score estimation into downstream design-based inference. It accommodates both parametric and nonparametric propensity score models, integrates seamlessly with standard tools in design-based inference with known propensity scores, and is universally applicable to various important design-based inference problems, such as observational studies, real-world surveys, and missing-data analyses, among many others. Simulation studies demonstrate that the proposed framework restores nominal coverage levels in settings where conventional methods suffer from severe under-coverage.

翻译：基于设计的推断（亦称随机化推断或有限总体推断）为因果与描述性分析提供了一个原则性框架，该框架将随机性完全归因于设计机制（如处理分配、抽样或缺失机制），而不对研究单元的结果数据施加分布或建模假设。尽管这一框架在概念上具有吸引力且历史悠久，但当底层设计概率（即倾向得分）未知时——这在观察性研究、现实世界调查和缺失数据场景中十分常见——其应用变得极具挑战性。现有的插件法或基于匹配的方法要么忽略由估计倾向得分引起的不确定性，要么依赖于匹配后的均匀倾向得分条件（当存在多个或连续协变量时该条件通常被违反），从而导致系统性覆盖不足。有限总体M估计部分缓解了这些问题，但仍局限于参数化倾向得分模型。本文提出倾向得分传播这一通用框架，用于在倾向得分未知时实现有效的基于设计的推断。该框架引入了一种再生与并集过程，能够自动将倾向得分估计中的不确定性传播至下游的基于设计推断。它兼容参数化和非参数化倾向得分模型，可与已知倾向得分下基于设计推断的标准工具无缝集成，并普遍适用于各类重要的基于设计推断问题，如观察性研究、现实世界调查和缺失数据分析等。模拟研究表明，在传统方法存在严重覆盖不足的场景中，所提框架能够恢复名义覆盖水平。