Generative recommendation (GR) has emerged as a promising direction for recommender systems. Recently, large language models (LLMs) have been increasingly adopted for GR, as their rich pretrained knowledge is expected to help them generalize beyond common user behavior patterns that traditional memorization-oriented baselines can capture. However, existing LLM-based GR works largely ignore LLMs' well-known tendency to memorize, which, if present in LLMs fine-tuned for GR, would restrict their utilization of pretrained knowledge. In this work, we investigate this concern by examining one-hop memorization, where a model recommends items that are direct successors of items in the training data. We show that LLMs do this more than non-LLM-based GR models-in fact, the vast majority of their gains over GR baselines are actually on users whose target items can be predicted through one-hop memorization. We intuit that improving performance on the remaining users requires LLMs to learn richer item-item relations beyond one-hop transitions. To achieve this, we propose IIRG, a novel training strategy that teaches LLMs to capture: (1) collaborative relations derived from item co-occurrences across multiple hops in user sequences, and (2) semantic relations among items with similar themes, both of which can serve as useful recommendation signals. We show that IIRG significantly improves over LLMs trained solely with standard next-item prediction, with especially large gains for users whose test items are not covered by train-time one-hop transitions.
翻译:生成式推荐(GR)已成为推荐系统的一个有前景的研究方向。近期,大型语言模型(LLMs)被越来越多地应用于GR,其丰富的预训练知识有望帮助模型泛化超越传统以记忆为导向的基线方法所能捕捉的常见用户行为模式。然而,现有的基于LLM的GR研究大多忽视了LLM众所周知的记忆倾向——若这种倾向存在于为GR微调的LLM中,将限制其预训练知识的利用。本研究通过考察"一跳记忆"(即模型推荐训练数据中项目直接后继项的行为)来探究该问题。我们发现,LLM比非LLM的GR模型表现出更强的此类记忆行为——事实上,相较于GR基线方法的性能提升,绝大部分增益实际上来源于那些可以通过一跳记忆预测目标项的用户。我们推断,要提升其余用户的推荐效果,需要LLM学习超越一跳迁移的丰富项目间关系。为此,我们提出IIRG这一新型训练策略,教导LLM捕获两类有用的推荐信号:(1)基于用户序列中多跳共现模式导出的协同关系;(2)主题相似项目间的语义关系。实验表明,相较于仅使用标准下一项预测训练的LLM,IIRG能显著提升性能,尤其对测试项目未在训练时的一跳迁移中出现的用户,增益更为突出。