Towards Graph Foundation Models for Personalization

In the realm of personalization, integrating diverse information sources such as consumption signals and content-based representations is becoming increasingly critical to build state-of-the-art solutions. In this regard, two of the biggest trends in research around this subject are Graph Neural Networks (GNNs) and Foundation Models (FMs). While GNNs emerged as a popular solution in industry for powering personalization at scale, FMs have only recently caught attention for their promising performance in personalization tasks like ranking and retrieval. In this paper, we present a graph-based foundation modeling approach tailored to personalization. Central to this approach is a Heterogeneous GNN (HGNN) designed to capture multi-hop content and consumption relationships across a range of recommendable item types. To ensure the generality required from a Foundation Model, we employ a Large Language Model (LLM) text-based featurization of nodes that accommodates all item types, and construct the graph using co-interaction signals, which inherently transcend content specificity. To facilitate practical generalization, we further couple the HGNN with an adaptation mechanism based on a two-tower (2T) architecture, which also operates agnostically to content type. This multi-stage approach ensures high scalability; while the HGNN produces general purpose embeddings, the 2T component models in a continuous space the sheer size of user-item interaction data. Our comprehensive approach has been rigorously tested and proven effective in delivering recommendations across a diverse array of products within a real-world, industrial audio streaming platform.

翻译：在个性化领域，整合消费信号和基于内容的表示等多源信息，日益成为构建先进解决方案的关键。围绕这一主题，当前两大研究趋势分别是图神经网络（GNNs）和基础模型（FMs）。虽然GNNs已作为规模化驱动个性化的流行解决方案在工业界广泛应用，但FMs近期才因其在排序和检索等个性化任务中的出色表现而受到关注。本文提出了一种面向个性化的基于图的基础建模方法。该方法的核心理念是设计一个异构GNN（HGNN），用于捕获跨多种可推荐内容类型的多跳内容与消费关系。为确保基础模型所需的通用性，我们采用基于大型语言模型（LLM）的文本特征化方法对节点进行表征，该方法兼容所有内容类型，并利用天然超越内容特异性的共交互信号构建图结构。为促进实际场景中的泛化能力，我们将HGNN与基于双塔（2T）架构的适应机制相结合，该机制同样对内容类型无感知。这种多阶段方法确保了高度可扩展性：HGNN生成通用嵌入，而2T组件则在连续空间中建模海量用户-物品交互数据。我们对该综合性方法进行了严格测试，并在真实工业音频流平台上的多种产品推荐场景中验证了其有效性。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日