Recently, a growing body of research has focused on either optimizing CTR model architectures to better model feature interactions or refining training objectives to aid parameter learning, thereby achieving better predictive performance. However, previous efforts have primarily focused on the training phase, largely neglecting opportunities for optimization during the inference phase. Infrequently occurring feature combinations, in particular, can degrade prediction performance, leading to unreliable or low-confidence outputs. To unlock the predictive potential of trained CTR models, we propose a Model-Agnostic Test-Time paradigm (MATT), which leverages the confidence scores of feature combinations to guide the generation of multiple inference paths, thereby mitigating the influence of low-confidence features on the final prediction. Specifically, to quantify the confidence of feature combinations, we introduce a hierarchical probabilistic hashing method to estimate the occurrence frequencies of feature combinations at various orders, which serve as their corresponding confidence scores. Then, using the confidence scores as sampling probabilities, we generate multiple instance-specific inference paths through iterative sampling and subsequently aggregate the prediction scores from multiple paths to conduct robust predictions. Finally, extensive offline experiments and online A/B tests strongly validate the compatibility and effectiveness of MATT across existing CTR models.
翻译:近年来,大量研究聚焦于优化点击率预测(CTR)模型架构以更好地建模特征交互,或改进训练目标以辅助参数学习,从而提升预测性能。然而,先前的工作主要关注训练阶段,在很大程度上忽略了推理阶段的优化机会。特别是,低频特征组合会降低预测性能,导致输出结果不可靠或置信度较低。为解锁已训练CTR模型的预测潜力,我们提出一种模型无关测试时范式(MATT),该方法利用特征组合的置信度分数引导生成多条推理路径,从而减轻低置信度特征对最终预测的影响。具体而言,为量化特征组合的置信度,我们引入层次化概率哈希方法,用于估计不同阶次特征组合的出现频率,并将其作为对应的置信度分数。随后,以置信度分数作为采样概率,通过迭代采样生成多个实例特定的推理路径,并聚合多条路径的预测分数以进行鲁棒预测。最后,大量离线实验与在线A/B测试充分验证了MATT在现有CTR模型上的兼容性与有效性。