Narrative-driven recommendation (NDR) presents an information access problem where users solicit recommendations with verbose descriptions of their preferences and context, for example, travelers soliciting recommendations for points of interest while describing their likes/dislikes and travel circumstances. These requests are increasingly important with the rise of natural language-based conversational interfaces for search and recommendation systems. However, NDR lacks abundant training data for models, and current platforms commonly do not support these requests. Fortunately, classical user-item interaction datasets contain rich textual data, e.g., reviews, which often describe user preferences and context - this may be used to bootstrap training for NDR models. In this work, we explore using large language models (LLMs) for data augmentation to train NDR models. We use LLMs for authoring synthetic narrative queries from user-item interactions with few-shot prompting and train retrieval models for NDR on synthetic queries and user-item interaction data. Our experiments demonstrate that this is an effective strategy for training small-parameter retrieval models that outperform other retrieval and LLM baselines for narrative-driven recommendation.
翻译:叙事驱动推荐(NDR)是一种信息访问问题,用户通过详细描述其偏好和情境来寻求推荐,例如旅行者在描述其喜好/厌恶及旅行情况的同时,寻求兴趣点推荐。随着基于自然语言的对话式搜索与推荐系统的兴起,此类需求日益重要。然而,NDR缺乏充足的模型训练数据,且当前平台通常不支持此类请求。幸运的是,经典的用户-物品交互数据集包含丰富的文本数据(如评论),这些数据常描述用户偏好和情境——这可用于启动NDR模型的训练。本研究探索使用大语言模型(LLMs)进行数据增强以训练NDR模型。我们利用LLMs通过少样本提示从用户-物品交互中生成合成叙事查询,并基于合成查询与用户-物品交互数据训练NDR检索模型。实验表明,这是训练小参数检索模型的有效策略,这些模型在叙事驱动推荐任务上优于其他检索模型和LLM基线方法。