INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning

Large language models (LLMs) have demonstrated impressive capabilities in various natural language processing tasks. Despite this, their application to information retrieval (IR) tasks is still challenging due to the infrequent occurrence of many IR-specific concepts in natural language. While prompt-based methods can provide task descriptions to LLMs, they often fall short in facilitating a comprehensive understanding and execution of IR tasks, thereby limiting LLMs' applicability. To address this gap, in this work, we explore the potential of instruction tuning to enhance LLMs' proficiency in IR tasks. We introduce a novel instruction tuning dataset, INTERS, encompassing 20 tasks across three fundamental IR categories: query understanding, document understanding, and query-document relationship understanding. The data are derived from 43 distinct datasets with manually written templates. Our empirical results reveal that INTERS significantly boosts the performance of various publicly available LLMs, such as LLaMA, Mistral, and Phi, in IR tasks. Furthermore, we conduct extensive experiments to analyze the effects of instruction design, template diversity, few-shot demonstrations, and the volume of instructions on performance. We make our dataset and the fine-tuned models publicly accessible at~\url{https://github.com/DaoD/INTERS}.

翻译：大语言模型（LLMs）已在各类自然语言处理任务中展现出卓越能力。然而，由于信息检索（IR）领域的许多特定概念在自然语言中出现频率较低，LLMs在该领域的应用仍面临挑战。尽管基于提示的方法能为LLMs提供任务描述，但这些方法往往难以有效促进对IR任务的全面理解与执行，从而限制了LLMs的适用性。为弥补这一不足，本研究探索了通过指令微调增强LLMs在IR任务中能力的潜力。我们提出了一种新型指令微调数据集INTERS，涵盖查询理解、文档理解及查询-文档关系理解三大基础IR类别中的20项任务。该数据集源自43个不同数据集，并辅以人工编写的模板。实验结果表明，INTERS显著提升了多个公开可用LLMs（如LLaMA、Mistral和Phi）在IR任务中的性能。此外，我们通过广泛实验分析了指令设计、模板多样性、少样本示例及指令数量对性能的影响。数据集和微调后的模型已在~\url{https://github.com/DaoD/INTERS} 公开提供。

相关内容

关注 14

信息检索杂志（IR）为信息检索的广泛领域中的理论、算法分析和实验的发布提供了一个国际论坛。感兴趣的主题包括对应用程序（例如Web，社交和流媒体，推荐系统和文本档案）的搜索、索引、分析和评估。这包括对搜索中人为因素的研究、桥接人工智能和信息检索以及特定领域的搜索应用程序。官网地址：https://dblp.uni-trier.de/db/journals/ir/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日