Recommender systems relying on Language Models (LMs) have gained popularity in assisting users to navigate large catalogs. LMs often exploit item high-level descriptors, i.e. categories or consumption contexts, from training data or user preferences. This has been proven effective in domains like movies or products. However, in the music domain, understanding how effectively LMs utilize song descriptors for natural language-based music recommendation is relatively limited. In this paper, we assess LMs effectiveness in recommending songs based on user natural language descriptions and items with descriptors like genres, moods, and listening contexts. We formulate the recommendation task as a dense retrieval problem and assess LMs as they become increasingly familiar with data pertinent to the task and domain. Our findings reveal improved performance as LMs are fine-tuned for general language similarity, information retrieval, and mapping longer descriptions to shorter, high-level descriptors in music.
翻译:依赖语言模型(LM)的推荐系统在帮助用户浏览大型目录方面日益普及。语言模型通常利用训练数据或用户偏好中的项目高级描述符(如类别或消费场景)。这在电影或产品等领域已被证明是有效的。然而,在音乐领域,关于语言模型如何有效利用歌曲描述符进行基于自然语言的音乐推荐,相关研究仍相对有限。本文评估了语言模型根据用户自然语言描述以及包含流派、情绪和聆听场景等描述符的项目来推荐歌曲的有效性。我们将推荐任务构建为稠密检索问题,并评估语言模型在逐渐熟悉任务相关数据及领域知识时的表现。研究结果表明,当语言模型针对通用语言相似性、信息检索以及将较长描述映射为音乐领域简短高级描述符进行微调时,其推荐性能得到显著提升。