Effective user representations are pivotal in personalized advertising. However, stringent constraints on training throughput, serving latency, and memory, often limit the complexity and input feature set of online ads ranking models. This challenge is magnified in extensive systems like Meta's, which encompass hundreds of models with diverse specifications, rendering the tailoring of user representation learning for each model impractical. To address these challenges, we present Scaling User Modeling (SUM), a framework widely deployed in Meta's ads ranking system, designed to facilitate efficient and scalable sharing of online user representation across hundreds of ads models. SUM leverages a few designated upstream user models to synthesize user embeddings from massive amounts of user features with advanced modeling techniques. These embeddings then serve as inputs to downstream online ads ranking models, promoting efficient representation sharing. To adapt to the dynamic nature of user features and ensure embedding freshness, we designed SUM Online Asynchronous Platform (SOAP), a latency free online serving system complemented with model freshness and embedding stabilization, which enables frequent user model updates and online inference of user embeddings upon each user request. We share our hands-on deployment experiences for the SUM framework and validate its superiority through comprehensive experiments. To date, SUM has been launched to hundreds of ads ranking models in Meta, processing hundreds of billions of user requests daily, yielding significant online metric gains and improved infrastructure efficiency.
翻译:有效的用户表征在个性化广告中至关重要。然而,训练吞吐量、服务延迟和内存方面的严格约束,常常限制了在线广告排序模型的复杂性和输入特征集。这一挑战在Meta等大型系统中尤为突出,此类系统包含数百个具有不同规格的模型,使得为每个模型定制用户表征学习变得不切实际。为应对这些挑战,我们提出了规模化用户建模(SUM)框架,该框架已在Meta广告排序系统中广泛部署,旨在促进数百个广告模型之间高效、可扩展的在线用户表征共享。SUM利用少数指定的上游用户模型,通过先进的建模技术从海量用户特征中合成用户嵌入向量。这些嵌入向量随后作为下游在线广告排序模型的输入,促进了高效的表征共享。为适应用户特征的动态特性并确保嵌入的新鲜度,我们设计了SUM在线异步平台(SOAP),这是一个无延迟的在线服务系统,辅以模型新鲜度和嵌入稳定性机制,支持频繁的用户模型更新以及在每次用户请求时进行在线用户嵌入推理。我们分享了SUM框架的实际部署经验,并通过全面的实验验证了其优越性。截至目前,SUM已在Meta数百个广告排序模型中上线,每日处理数千亿用户请求,取得了显著的在线指标提升和基础设施效率改进。