Scaling User Modeling: Large-scale Online User Representations for Ads Personalization in Meta

Wei Zhang,Dai Li,Chen Liang,Fang Zhou,Zhongke Zhang,Xuewei Wang,Ru Li,Yi Zhou,Yaning Huang,Dong Liang,Kai Wang,Zhangyuan Wang,Zhengxing Chen,Min Li,Fenggang Wu,Minghai Chen,Huayu Li,Yunnan Wu,Zhan Shu,Mindi Yuan,Sri Reddy

from arxiv, 8 pages, 3 figures

Effective user representations are pivotal in personalized advertising. However, stringent constraints on training throughput, serving latency, and memory, often limit the complexity and input feature set of online ads ranking models. This challenge is magnified in extensive systems like Meta's, which encompass hundreds of models with diverse specifications, rendering the tailoring of user representation learning for each model impractical. To address these challenges, we present Scaling User Modeling (SUM), a framework widely deployed in Meta's ads ranking system, designed to facilitate efficient and scalable sharing of online user representation across hundreds of ads models. SUM leverages a few designated upstream user models to synthesize user embeddings from massive amounts of user features with advanced modeling techniques. These embeddings then serve as inputs to downstream online ads ranking models, promoting efficient representation sharing. To adapt to the dynamic nature of user features and ensure embedding freshness, we designed SUM Online Asynchronous Platform (SOAP), a latency free online serving system complemented with model freshness and embedding stabilization, which enables frequent user model updates and online inference of user embeddings upon each user request. We share our hands-on deployment experiences for the SUM framework and validate its superiority through comprehensive experiments. To date, SUM has been launched to hundreds of ads ranking models in Meta, processing hundreds of billions of user requests daily, yielding significant online metric gains and infrastructure cost savings.

翻译：有效的用户表示在个性化广告中至关重要。然而，训练吞吐量、服务延迟和内存的严格限制，往往制约了在线广告排序模型的复杂度和输入特征集。这一挑战在Meta等大型系统中被放大，该系统包含数百个具有不同规格的模型，使得为每个模型定制用户表示学习变得不切实际。为应对这些挑战，我们提出了用户建模扩展（SUM）框架，该框架已广泛部署于Meta的广告排序系统，旨在促进跨数百个广告模型的高效、可扩展的在线用户表示共享。SUM利用少数指定的上游用户模型，通过先进的建模技术，从海量用户特征中合成用户嵌入。这些嵌入随后作为下游在线广告排序模型的输入，促进高效的表示共享。为适应用户特征的动态性并确保嵌入的新鲜度，我们设计了SUM在线异步平台（SOAP），这是一个无延迟的在线服务系统，辅以模型新鲜度和嵌入稳定性，能够支持频繁的用户模型更新以及在每次用户请求时在线推断用户嵌入。我们分享了SUM框架的实战部署经验，并通过全面实验验证了其优越性。截至目前，SUM已在Meta的数百个广告排序模型中上线，每日处理数千亿用户请求，带来了显著的在线指标提升和基础设施成本节约。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日