TWIN V2: Scaling Ultra-Long User Behavior Sequence Modeling for Enhanced CTR Prediction at Kuaishou

Zihua Si,Lin Guan,ZhongXiang Sun,Xiaoxue Zang,Jing Lu,Yiqun Hui,Xingchao Cao,Zeyu Yang,Yichen Zheng,Dewei Leng,Kai Zheng,Chenbin Zhang,Yanan Niu,Yang Song,Kun Gai

from arxiv, Accepted by CIKM 2024

The significance of modeling long-term user interests for CTR prediction tasks in large-scale recommendation systems is progressively gaining attention among researchers and practitioners. Existing work, such as SIM and TWIN, typically employs a two-stage approach to model long-term user behavior sequences for efficiency concerns. The first stage rapidly retrieves a subset of sequences related to the target item from a long sequence using a search-based mechanism namely the General Search Unit (GSU), while the second stage calculates the interest scores using the Exact Search Unit (ESU) on the retrieved results. Given the extensive length of user behavior sequences spanning the entire life cycle, potentially reaching up to 10^6 in scale, there is currently no effective solution for fully modeling such expansive user interests. To overcome this issue, we introduced TWIN-V2, an enhancement of TWIN, where a divide-and-conquer approach is applied to compress life-cycle behaviors and uncover more accurate and diverse user interests. Specifically, a hierarchical clustering method groups items with similar characteristics in life-cycle behaviors into a single cluster during the offline phase. By limiting the size of clusters, we can compress behavior sequences well beyond the magnitude of 10^5 to a length manageable for online inference in GSU retrieval. Cluster-aware target attention extracts comprehensive and multi-faceted long-term interests of users, thereby making the final recommendation results more accurate and diverse. Extensive offline experiments on a multi-billion-scale industrial dataset and online A/B tests have demonstrated the effectiveness of TWIN-V2. Under an efficient deployment framework, TWIN-V2 has been successfully deployed to the primary traffic that serves hundreds of millions of daily active users at Kuaishou.

翻译：在大规模推荐系统中，建模长期用户兴趣对于点击率预测任务的重要性正日益受到研究者和从业者的关注。现有工作，例如SIM和TWIN，通常出于效率考虑采用两阶段方法来建模长期用户行为序列。第一阶段通过基于搜索的机制，即通用搜索单元，从长序列中快速检索出与目标物品相关的子序列；第二阶段则使用精确搜索单元在检索结果上计算兴趣得分。鉴于用户行为序列覆盖整个生命周期，长度极大，规模可能高达10^6，目前尚无有效方案能对此类广泛的用户兴趣进行完整建模。为克服此问题，我们提出了TWIN的增强版本TWIN-V2，其中采用分治法压缩生命周期行为，以挖掘更准确、更多样的用户兴趣。具体而言，在离线阶段，一种层次聚类方法将生命周期行为中具有相似特征的物品分组到单个簇中。通过限制簇的规模，我们可以将行为序列从远超10^5的量级压缩至GSU检索中在线推理可处理的长度。簇感知目标注意力机制提取用户全面且多方面的长期兴趣，从而使最终推荐结果更加准确和多样。在数十亿规模的工业数据集上进行的大量离线实验以及在线A/B测试，均证明了TWIN-V2的有效性。在高效的部署框架下，TWIN-V2已成功部署至快手服务数亿日活跃用户的主流量中。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日