LLaRA: Aligning Large Language Models with Sequential Recommenders

Sequential recommendation aims to predict the subsequent items matching user preference based on her/his historical interactions. With the development of Large Language Models (LLMs), there is growing interest in exploring the potential of LLMs for sequential recommendation by framing it as a language modeling task. Prior works represent items in the textual prompts using either ID indexing or text indexing and feed the prompts into LLMs, but falling short of either encapsulating comprehensive world knowledge or exhibiting sufficient sequential understanding. To harness the complementary strengths of traditional recommenders (which encode user behavioral knowledge) and LLMs (which possess world knowledge about items), we propose LLaRA -- a Large Language and Recommendation Assistant framework. Specifically, LLaRA represents items in LLM's input prompts using a novel hybrid approach that integrates ID-based item embeddings from traditional recommenders with textual item features. Viewing the ``sequential behavior of the user'' as a new modality in recommendation, we employ an adapter to bridge the modality gap between ID embeddings of the traditional recommenders and the input space of LLMs. Furthermore, instead of directly exposing the hybrid prompt to LLMs, we apply a curriculum learning approach to gradually ramp up training complexity. We first warm up the LLM with text-only prompting, which aligns more naturally with the LLM's language modeling capabilities. Thereafter, we progressively transition to hybrid prompting, training the adapter to incorporate behavioral knowledge from the traditional sequential recommender into the LLM. Extensive experiments demonstrate the efficacy of LLaRA framework. Our code and data are available at https://github.com/ljy0ustc/LLaRA .

翻译：序列推荐旨在根据用户的历史交互行为预测其偏好的后续项目。随着大语言模型（LLMs）的发展，将其应用于序列推荐（通过将其构建为语言建模任务）的研究兴趣日益增长。现有工作通过ID索引或文本索引在文本提示中表示项目，并将提示输入LLMs，但要么缺乏对综合世界知识的封装，要么表现出不足的序列理解能力。为发挥传统推荐器（编码用户行为知识）与LLMs（拥有物品世界知识）的互补优势，我们提出LLaRA——一种大语言与推荐助手框架。具体而言，LLaRA采用新颖的混合方法在LLM输入提示中表示项目，该方法将传统推荐器的基于ID的项目嵌入与文本项目特征相结合。将“用户的序列行为”视为推荐中的新模态，我们采用适配器来弥合传统推荐器ID嵌入与LLM输入空间之间的模态差距。此外，我们并未直接将混合提示暴露给LLM，而是应用课程学习方法逐步提升训练复杂度。首先使用纯文本提示对LLM进行热身训练，这更自然地契合LLM的语言建模能力；随后逐步过渡到混合提示训练，使适配器能够将传统序列推荐器的行为知识融入LLM。大量实验证明了LLaRA框架的有效性。我们的代码与数据可在 https://github.com/ljy0ustc/LLaRA 获取。

相关内容

大语言模型

关注 67

大语言模型是基于海量文本数据训练的深度学习模型。它不仅能够生成自然语言文本，还能够深入理解文本含义，处理各种自然语言任务，如文本摘要、问答、翻译等。2023年，大语言模型及其在人工智能领域的应用已成为全球科技研究的热点，其在规模上的增长尤为引人注目，参数量已从最初的十几亿跃升到如今的一万亿。参数量的提升使得模型能够更加精细地捕捉人类语言微妙之处，更加深入地理解人类语言的复杂性。在过去的一年里，大语言模型在吸纳新知识、分解复杂任务以及图文对齐等多方面都有显著提升。随着技术的不断成熟，它将不断拓展其应用范围，为人类提供更加智能化和个性化的服务，进一步改善人们的生活和生产方式。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日