Modern industrial recommendation systems improve recommendation performance by integrating multimodal representations from pre-trained models into ID-based Click-Through Rate (CTR) prediction frameworks. However, existing approaches typically adopt modality-centric modeling strategies that process ID-based and multimodal embeddings independently, failing to capture fine-grained interactions between content semantics and behavioral signals. In this paper, we propose Decoupled Multimodal Fusion (DMF), which introduces a modality-enriched modeling strategy to enable fine-grained interactions between ID-based collaborative representations and multimodal representations for user interest modeling. Specifically, we construct target-aware features to bridge the semantic gap across different embedding spaces and leverage them as side information to enhance the effectiveness of user interest modeling. Furthermore, we design an inference-optimized attention mechanism that decouples the computation of target-aware features and ID-based embeddings before the attention layer, thereby alleviating the computational bottleneck introduced by incorporating target-aware features. To achieve comprehensive multimodal integration, DMF combines user interest representations learned under the modality-centric and modality-enriched modeling strategies. Offline experiments on public and industrial datasets demonstrate the effectiveness of DMF. Moreover, DMF has been deployed on the product recommendation system of the international e-commerce platform Lazada, achieving relative improvements of 5.30% in CTCVR and 7.43% in GMV with negligible computational overhead.
翻译:现代工业推荐系统通过将预训练模型的多模态表示集成到基于ID的点击率(CTR)预测框架中,提升了推荐性能。然而,现有方法通常采用以模态为中心的建模策略,独立处理基于ID的嵌入和多模态嵌入,未能捕捉内容语义与行为信号之间的细粒度交互。在本文中,我们提出解耦多模态融合(DMF),该方法引入了一种模态增强建模策略,能够实现基于ID的协同表示与多模态表示之间的细粒度交互,以用于用户兴趣建模。具体而言,我们构建了目标感知特征以桥接不同嵌入空间之间的语义鸿沟,并将其作为辅助信息来增强用户兴趣建模的有效性。此外,我们设计了一种推理优化的注意力机制,在注意力层之前解耦目标感知特征和基于ID的嵌入的计算,从而缓解引入目标感知特征所带来的计算瓶颈。为实现全面的多模态融合,DMF结合了在以模态为中心和模态增强建模策略下学习到的用户兴趣表示。在公开和工业数据集上的离线实验证明了DMF的有效性。此外,DMF已部署在国际电商平台Lazada的商品推荐系统中,在计算开销可忽略的情况下,实现了CTCVR相对提升5.30%和GMV相对提升7.43%。