Uncovering ChatGPT's Capabilities in Recommender Systems

The debut of ChatGPT has recently attracted the attention of the natural language processing (NLP) community and beyond. Existing studies have demonstrated that ChatGPT shows significant improvement in a range of downstream NLP tasks, but the capabilities and limitations of ChatGPT in terms of recommendations remain unclear. In this study, we aim to conduct an empirical analysis of ChatGPT's recommendation ability from an Information Retrieval (IR) perspective, including point-wise, pair-wise, and list-wise ranking. To achieve this goal, we re-formulate the above three recommendation policies into a domain-specific prompt format. Through extensive experiments on four datasets from different domains, we demonstrate that ChatGPT outperforms other large language models across all three ranking policies. Based on the analysis of unit cost improvements, we identify that ChatGPT with list-wise ranking achieves the best trade-off between cost and performance compared to point-wise and pair-wise ranking. Moreover, ChatGPT shows the potential for mitigating the cold start problem and explainable recommendation. To facilitate further explorations in this area, the full code and detailed original results are open-sourced at https://github.com/rainym00d/LLM4RS.

翻译：ChatGPT的亮相近期引起了自然语言处理（NLP）领域及之外的广泛关注。现有研究表明，ChatGPT在一系列下游NLP任务中表现出显著提升，但其在推荐方面的能力与局限性仍不清楚。本研究旨在从信息检索（IR）视角对ChatGPT的推荐能力进行实证分析，涵盖逐点排序、成对排序和列表排序。为实现这一目标，我们将上述三种推荐策略重新表述为领域特定的提示格式。通过在四个不同领域数据集上的广泛实验，我们证明ChatGPT在所有三种排序策略上均优于其他大型语言模型。基于单位成本改进的分析，我们确定了与逐点排序和成对排序相比，采用列表排序的ChatGPT在成本与性能之间实现了最佳权衡。此外，ChatGPT展现出缓解冷启动问题及进行可解释推荐的潜力。为促进该领域的进一步探索，完整代码和详细原始结果已在 https://github.com/rainym00d/LLM4RS 开源。

相关内容

ChatGPT

关注 258

ChatGPT（全名：Chat Generative Pre-trained Transformer），美国OpenAI 研发的聊天机器人程序 [1] ，于2022年11月30日发布。ChatGPT是人工智能技术驱动的自然语言处理工具，它能够通过学习和理解人类的语言来进行对话，还能根据聊天的上下文进行互动，真正像人类一样来聊天交流，甚至能完成撰写邮件、视频脚本、文案、翻译、代码，写论文任务。 [1] https://openai.com/blog/chatgpt/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日