Methods for adjusting for covariate measurement error in flexible modelling of functional form: results of a blinded, controlled neutral comparison simulation study

翻译：协变量测量误差在灵活函数形式建模中的校正方法：一项盲法、受控中性比较模拟研究结果

Mohammed Sedki,Aris Perperoglou,Anne C. M. Thiébaut,Steve Ferreira Guerra,Paul Gustafson,Frank E. Harrell,Willi Sauerbrei,Michal Abrahamowicz,Laurence S. Freedman

Covariate measurement error is pervasive in epidemiological research and distorts estimated exposure-outcome associations, yet correction methods have been studied almost exclusively under linear modelling assumptions. Their behaviour when the underlying association is non-linear and is itself estimated with flexible regression, remains poorly characterised. We report a blinded, multi-stage neutral comparison simulation study, conducted within the STRATOS initiative, evaluating measurement error correction coupled with flexible modelling of functional form. Six families of correction methods (pointwise and coefficient-based Simulation Extrapolation [SIMEX], Bayesian inference on the logit and risk scales, Multiple Imputation [MI], and Regression Calibration [RC]) were each combined with B-splines (BS), penalised splines (PS), fractional polynomials (FP), and natural splines (NS), yielding 23 analytic methods. Methods were applied to case-control data generated under five functional forms (J-shape, linear, two threshold models, and saturation) across simulated datasets spanning varying sample sizes, replication substudy sizes, error magnitudes, and error distributions, with classical additive error and a replication substudy for error calibration. Performance was assessed by the log mean squared error of the estimated function over the central 95 % of the exposure distribution. Pointwise SIMEX was the most accurate and most robust approach overall, followed by Bayesian methods and RC when paired with PS, FP, or NS; MI performed less well, and Bayesian estimation with unpenalised BS performed worst. PS, FP, and NS were near-equivalent, whereas BS was consistently inferior. No single method dominated across all scenarios, underscoring the value of sensitivity analyses.

翻译：协变量测量误差在流行病学研究中普遍存在，并会导致暴露-结局关联估计的偏倚，然而校正方法几乎仅在线性建模假设下得到研究。当潜在关联呈非线性且其本身通过灵活回归进行估计时，这些方法的性能仍缺乏充分表征。我们报告了一项在STRATOS计划框架内开展的多阶段盲法中立的比较模拟研究，旨在评估结合灵活函数形式建模的测量误差校正方法。六类校正方法（逐点法与基于系数的模拟外推法[SIMEX]、在logit和风险尺度上的贝叶斯推断、多重插补法[MI]和回归校准法[RC]）分别与B样条（BS）、惩罚样条（PS）、分数多项式（FP）和自然样条（NS）组合，共形成23种分析方法。这些方法应用于五种函数形式（J形、线性、两种阈值模型和饱和模型）生成的病例-对照数据，模拟数据集涵盖不同样本量、重复子研究规模、误差幅度和误差分布，采用经典加性误差模型和重复子研究进行误差校准。通过暴露分布中间95%区间内估计函数的对数均方误差评估性能。逐点SIMEX在整体准确性和稳健性方面表现最优，其次为与PS、FP或NS联合使用的贝叶斯方法及RC；MI表现较差，而基于未惩罚BS的贝叶斯估计表现最差。PS、FP和NS性能近似，而BS始终处于劣势。在所有场景中，没有任何单一方法占绝对优势，这凸显了敏感性分析的重要性。