Scaling User Modeling: Large-scale Online User Representations for Ads Personalization in Meta

Wei Zhang,Dai Li,Chen Liang,Fang Zhou,Zhongke Zhang,Xuewei Wang,Ru Li,Yi Zhou,Yaning Huang,Dong Liang,Kai Wang,Zhangyuan Wang,Zhengxing Chen,Fenggang Wu,Minghai Chen,Huayu Li,Yunnan Wu,Zhan Shu,Mindi Yuan,Sri Reddy

from arxiv, 8 pages, 3 figures

Effective user representations are pivotal in personalized advertising. However, stringent constraints on training throughput, serving latency, and memory, often limit the complexity and input feature set of online ads ranking models. This challenge is magnified in extensive systems like Meta's, which encompass hundreds of models with diverse specifications, rendering the tailoring of user representation learning for each model impractical. To address these challenges, we present Scaling User Modeling (SUM), a framework widely deployed in Meta's ads ranking system, designed to facilitate efficient and scalable sharing of online user representation across hundreds of ads models. SUM leverages a few designated upstream user models to synthesize user embeddings from massive amounts of user features with advanced modeling techniques. These embeddings then serve as inputs to downstream online ads ranking models, promoting efficient representation sharing. To adapt to the dynamic nature of user features and ensure embedding freshness, we designed SUM Online Asynchronous Platform (SOAP), a latency free online serving system complemented with model freshness and embedding stabilization, which enables frequent user model updates and online inference of user embeddings upon each user request. We share our hands-on deployment experiences for the SUM framework and validate its superiority through comprehensive experiments. To date, SUM has been launched to hundreds of ads ranking models in Meta, processing hundreds of billions of user requests daily, yielding significant online metric gains and improved infrastructure efficiency.

翻译：有效的用户表征在个性化广告中至关重要。然而，训练吞吐量、服务延迟和内存方面的严格约束，常常限制了在线广告排序模型的复杂性和输入特征集。这一挑战在Meta等大型系统中尤为突出，此类系统包含数百个具有不同规格的模型，使得为每个模型定制用户表征学习变得不切实际。为应对这些挑战，我们提出了规模化用户建模（SUM）框架，该框架已在Meta广告排序系统中广泛部署，旨在促进数百个广告模型之间高效、可扩展的在线用户表征共享。SUM利用少数指定的上游用户模型，通过先进的建模技术从海量用户特征中合成用户嵌入向量。这些嵌入向量随后作为下游在线广告排序模型的输入，促进了高效的表征共享。为适应用户特征的动态特性并确保嵌入的新鲜度，我们设计了SUM在线异步平台（SOAP），这是一个无延迟的在线服务系统，辅以模型新鲜度和嵌入稳定性机制，支持频繁的用户模型更新以及在每次用户请求时进行在线用户嵌入推理。我们分享了SUM框架的实际部署经验，并通过全面的实验验证了其优越性。截至目前，SUM已在Meta数百个广告排序模型中上线，每日处理数千亿用户请求，取得了显著的在线指标提升和基础设施效率改进。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日