How should researchers conduct causal inference when the outcome of interest is latent and measured imperfectly by multiple indicators? We develop a general nonparametric framework for identifying and estimating average treatment effects on latent outcomes in randomized experiments. We show that latent-outcome estimation faces two distinct noncomparability challenges. First, across studies, different measurement systems may cause estimators to target different empirical quantities even when the underlying latent treatment effect is the same. Second, within a study, different indicators may have different and possibly nonlinear relationships with the same latent outcome, making them not directly comparable. To address these challenges, we propose a design-based approach built around nonparametric bridge functions. We show that these bridge functions can be characterized and identified. Estimation relies on a debiasing procedure that permits valid inference even when the bridge functions are weakly identified. Simulations demonstrate that standard methods, such as principal components analysis and inverse covariance weighting, can generate spurious cross-study differences, whereas our approach recovers comparable latent treatment effects. Overall, the framework provides both a general strategy for causal inference with latent outcomes and practical guidance for designing measurements that support identification, comparability, and efficient estimation.
翻译:当感兴趣的结果变量为潜变量且通过多个指标不完美测量时,研究者应如何进行因果推断?本文针对随机实验中的潜变量结果,提出一个通用的非参数框架,用于识别和估计其平均处理效应。研究表明,潜变量的估计面临两个不同的不可比挑战:第一,在跨研究情境下,即便潜在的真实潜变量处理效应相同,不同的测量系统也可能导致估计量瞄准不同的经验量;第二,在同一研究中,不同指标与同一潜变量可能存在不同甚至非线性的关系,使其无法直接比较。为应对这些挑战,我们提出一种基于非参数桥函数的设计导向方法,证明这些桥函数可被刻画并识别。估计过程依赖于去偏程序,该程序即使在桥函数弱识别时也能保证有效推断。模拟结果表明:主成分分析与逆协方差加权等标准方法可能产生虚假的跨研究差异,而我们的方法则可恢复可比性的潜变量处理效应。总体而言,该框架既为潜变量结果的因果推断提供了通用策略,也为支持识别、可比性与高效估计的测量设计提供了实用指导。