HoMer: Addressing Heterogeneities by Modeling Sequential and Set-wise Contexts for CTR Prediction

Click-through rate (CTR) prediction, which models behavior sequence and non-sequential features (e.g., user/item profiles or cross features) to infer user interest, underpins industrial recommender systems. However, most methods face three forms of heterogeneity that degrade predictive performance: (i) Feature Heterogeneity persists when limited sequence side features provide less granular interest representation compared to extensive non-sequential features, thereby impairing sequence modeling performance; (ii) Context Heterogeneity arises because a user's interest in an item will be influenced by other items, yet point-wise prediction neglects cross-item interaction context from the entire item set; (iii) Architecture Heterogeneity stems from the fragmented integration of specialized network modules, which compounds the model's effectiveness, efficiency and scalability in industrial deployments. To tackle the above limitations, we propose HoMer, a Homogeneous-Oriented TransforMer for modeling sequential and set-wise contexts. First, we align sequence side features with non-sequential features for accurate sequence modeling and fine-grained interest representation. Second, we shift the prediction paradigm from point-wise to set-wise, facilitating cross-item interaction in a highly parallel manner. Third, HoMer's unified encoder-decoder architecture achieves dual optimization through structural simplification and shared computation, ensuring computational efficiency while maintaining scalability with model size. Without arduous modification to the prediction pipeline, HoMer successfully scales up and outperforms our industrial baseline by 0.0099 in the AUC metric, and enhances online business metrics like CTR/RPM by 1.99%/2.46%. Additionally, HoMer saves 27% of GPU resources via preliminary engineering optimization, further validating its superiority and practicality.

翻译：点击率（CTR）预测通过建模行为序列与非序列特征（例如用户/物品画像或交叉特征）来推断用户兴趣，是工业推荐系统的基础。然而，大多数方法面临三种形式的异质性，导致预测性能下降：（i）特征异质性：当有限的序列侧特征相比丰富的非序列特征提供更粗糙的兴趣表示时，会损害序列建模性能；（ii）上下文异质性：用户对某物品的兴趣会受到其他物品的影响，但逐点预测忽略了整个物品集合中的跨物品交互上下文；（iii）架构异质性：源于专用网络模块的碎片化集成，这会损害模型在工业部署中的效果、效率与可扩展性。为应对上述局限，我们提出HoMer，一种面向同质化的Transformer，用于建模序列与集合上下文。首先，我们将序列侧特征与非序列特征对齐，以实现精确的序列建模和细粒度的兴趣表示。其次，我们将预测范式从逐点转向集合式，以高度并行的方式促进跨物品交互。第三，HoMer统一的编码器-解码器架构通过结构简化和计算共享实现双重优化，在保持模型规模可扩展性的同时确保计算效率。无需对预测流程进行复杂修改，HoMer成功实现规模化扩展，并在AUC指标上超越我们的工业基线0.0099，同时将在线业务指标如CTR/RPM提升了1.99%/2.46%。此外，通过初步的工程优化，HoMer节省了27%的GPU资源，进一步验证了其优越性与实用性。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日