Distributionally Robust Transfer Learning with Structurally Missing Covariates, with Application to Cross-National Cardiac Arrest Prediction

Siqi Li,Chuan Hong,Ziye Tian,Benjamin Sieu-Hon Leong,Koshi Nakagawa,Hideharu Tanaka,Sang Do Shin,Khuong Quoc Dai,Do Ngoc Son,Marcus Eng Hock Ong,Nan Liu,Molei Liu

Deploying clinical prediction models across healthcare systems often fails when key training covariates are unavailable at deployment and labeled outcomes are limited in the target domain. For example, high-performing models for out-of-hospital cardiac arrest (OHCA) rely on detailed prehospital measurements routinely collected in high-resource settings but unavailable in many international registries. Existing methods either discard missing covariates, sacrificing predictive information, or rely on untestable assumptions about their target distribution. We propose DRUM (\underline{D}istributionally \underline{R}obust \underline{U}nsupervised transfer learning with structurally \underline{M}issing covariates), a framework that transfers prediction models to target populations where certain covariates are structurally absent and outcome labels are unavailable. DRUM partitions covariates into shared components ($X$), observed across all settings, and missing components ($A$), observed only in the source. Rather than imputing missing covariates, DRUM optimizes worst-case predictive performance over the unknown target distribution of $A \mid X$ using a neural network generator, with a robustness parameter controlling allowable deviation from the source conditional. We further develop a bias correction procedure that reduces sensitivity to nuisance estimation error. Simulations show substantial improvements in both mean and worst-case prediction error under distribution shift. Applied to cross-national OHCA prediction, transferring models from a US registry to multiple Asian registries where prehospital variables are unrecorded, DRUM yields better-calibrated predictions and improved clinical classification performance across sites.

翻译：在医疗系统间部署临床预测模型时，关键训练协变量在部署场景中不可用且目标域标签数据有限，往往导致模型失效。例如，院外心脏骤停（OHCA）的高性能模型依赖高资源环境中常规采集的详细院前测量数据，但这类数据在许多国际登记系统中缺失。现有方法或是直接丢弃缺失协变量从而损失预测信息，或是依赖关于目标分布不可验证的假设。本文提出DRUM（基于结构缺失协变量的分布鲁棒无监督迁移学习框架），该框架可将预测模型迁移至某些协变量结构性缺失且无标签数据的目标人群。DRUM将协变量分为共享组件（$X$，所有场景均可观测）与缺失组件（$A$，仅在源域可观测）。不同于插补缺失协变量，DRUM通过神经网络生成器优化未知目标分布$A \mid X$下的最差预测性能，并引入鲁棒性参数控制与源域条件分布的允许偏离程度。我们进一步开发了偏差校正流程以降低扰动估计误差的影响。模拟实验表明，在分布偏移下，该方法在平均预测误差和最差预测误差上均有显著改善。应用于跨国OHCA预测时（将美国登记系统模型迁移至未记录院前变量的多个亚洲登记系统），DRUM在不同站点均实现了更优校准的预测结果和更佳的临床分类性能。