Collaboration of Large Language Models and Small Recommendation Models for Device-Cloud Recommendation

Large Language Models (LLMs) for Recommendation (LLM4Rec) is a promising research direction that has demonstrated exceptional performance in this field. However, its inability to capture real-time user preferences greatly limits the practical application of LLM4Rec because (i) LLMs are costly to train and infer frequently, and (ii) LLMs struggle to access real-time data (its large number of parameters poses an obstacle to deployment on devices). Fortunately, small recommendation models (SRMs) can effectively supplement these shortcomings of LLM4Rec diagrams by consuming minimal resources for frequent training and inference, and by conveniently accessing real-time data on devices. In light of this, we designed the Device-Cloud LLM-SRM Collaborative Recommendation Framework (LSC4Rec) under a device-cloud collaboration setting. LSC4Rec aims to integrate the advantages of both LLMs and SRMs, as well as the benefits of cloud and edge computing, achieving a complementary synergy. We enhance the practicability of LSC4Rec by designing three strategies: collaborative training, collaborative inference, and intelligent request. During training, LLM generates candidate lists to enhance the ranking ability of SRM in collaborative scenarios and enables SRM to update adaptively to capture real-time user interests. During inference, LLM and SRM are deployed on the cloud and on the device, respectively. LLM generates candidate lists and initial ranking results based on user behavior, and SRM get reranking results based on the candidate list, with final results integrating both LLM's and SRM's scores. The device determines whether a new candidate list is needed by comparing the consistency of the LLM's and SRM's sorted lists. Our comprehensive and extensive experimental analysis validates the effectiveness of each strategy in LSC4Rec.

翻译：基于大型语言模型的推荐系统（LLM4Rec）是一个前景广阔的研究方向，在该领域已展现出卓越性能。然而，其无法捕捉实时用户偏好的缺陷极大地限制了LLM4Rec的实际应用，原因在于：（i）大型语言模型的训练与频繁推理成本高昂；（ii）大型语言模型难以访问实时数据（其庞大的参数量成为设备端部署的障碍）。幸运的是，小型推荐模型（SRMs）能够有效弥补LLM4Rec的上述不足：它们能以极少的资源消耗进行频繁训练与推理，并可便捷地访问设备端实时数据。基于此，我们在设备-云端协同框架下设计了设备-云端LLM-SRM协同推荐框架（LSC4Rec）。LSC4Rec旨在整合大型语言模型与小型推荐模型的优势，以及云计算与边缘计算的益处，实现互补协同效应。我们通过设计三种策略来增强LSC4Rec的实用性：协同训练、协同推理与智能请求。在训练阶段，大型语言模型生成候选列表以增强小型推荐模型在协同场景下的排序能力，并使小型推荐模型能够自适应更新以捕捉实时用户兴趣。在推理阶段，大型语言模型与小型推荐模型分别部署于云端与设备端。大型语言模型基于用户行为生成候选列表与初始排序结果，小型推荐模型基于候选列表进行重排序，最终结果融合两者的评分。设备端通过比较大型语言模型与小型推荐模型排序列表的一致性来决定是否需要新的候选列表。我们全面而广泛的实验分析验证了LSC4Rec中各项策略的有效性。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日