Many standard estimators, when applied to adaptively collected data, fail to be asymptotically normal, thereby complicating the construction of confidence intervals. We address this challenge in a semi-parametric context: estimating the parameter vector of a generalized linear regression model contaminated by a non-parametric nuisance component. We construct suitably weighted estimating equations that account for adaptivity in data collection, and provide conditions under which the associated estimates are asymptotically normal. Our results characterize the degree of "explorability" required for asymptotic normality to hold. For the simpler problem of estimating a linear functional, we provide similar guarantees under much weaker assumptions. We illustrate our general theory with concrete consequences for various problems, including standard linear bandits and sparse generalized bandits, and compare with other methods via simulation studies.
翻译:许多标准估计量在应用于适应性收集数据时,无法实现渐近正态性,从而给置信区间的构建带来困难。我们在半参数背景下解决了这一挑战:估计受非参数干扰分量污染的广义线性回归模型的参数向量。我们构造了适当加权的估计方程,以解释数据收集中的适应性,并给出了相关估计量达到渐近正态性的条件。我们的结果刻画了实现渐近正态性所需的“可探索性”程度。对于更简单的线性泛函估计问题,我们在更弱的假设下给出了类似的保证。我们通过具体问题(包括标准线性bandits和稀疏广义bandits)的理论推导,以及与其他方法的仿真研究对比,阐释了我们的通用理论。