INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning

Large language models (LLMs) have demonstrated impressive capabilities in various natural language processing tasks. Despite this, their application to information retrieval (IR) tasks is still challenging due to the infrequent occurrence of many IR-specific concepts in natural language. While prompt-based methods can provide task descriptions to LLMs, they often fall short in facilitating a comprehensive understanding and execution of IR tasks, thereby limiting LLMs' applicability. To address this gap, in this work, we explore the potential of instruction tuning to enhance LLMs' proficiency in IR tasks. We introduce a novel instruction tuning dataset, INTERS, encompassing 20 tasks across three fundamental IR categories: query understanding, document understanding, and query-document relationship understanding. The data are derived from 43 distinct datasets with manually written templates. Our empirical results reveal that INTERS significantly boosts the performance of various publicly available LLMs, such as LLaMA, Mistral, and Phi, in IR tasks. Furthermore, we conduct extensive experiments to analyze the effects of instruction design, template diversity, few-shot demonstrations, and the volume of instructions on performance. We make our dataset and the fine-tuned models publicly accessible at https://github.com/DaoD/INTERS.

翻译：大型语言模型（LLM）在各种自然语言处理任务中展现出卓越的能力。尽管如此，由于许多信息检索（IR）特定概念在自然语言中并不常见，将其应用于信息检索任务仍具挑战性。虽然基于提示的方法可以向LLM提供任务描述，但它们往往难以促进对IR任务的全面理解和执行，从而限制了LLM的适用性。为弥补这一不足，本研究探索了指令微调在提升LLM执行IR任务能力方面的潜力。我们引入了一个新颖的指令微调数据集INTERS，涵盖查询理解、文档理解以及查询-文档关系理解这三个基础IR类别中的20项任务。该数据源自43个不同数据集，并辅以人工编写的模板。我们的实证结果表明，INTERS显著提升了多种公开可用LLM（如LLaMA、Mistral和Phi）在IR任务中的性能。此外，我们进行了大量实验，以分析指令设计、模板多样性、少样本示例以及指令数量对性能的影响。我们将数据集及微调后的模型公开于https://github.com/DaoD/INTERS。

相关内容

关注 14

信息检索杂志（IR）为信息检索的广泛领域中的理论、算法分析和实验的发布提供了一个国际论坛。感兴趣的主题包括对应用程序（例如Web，社交和流媒体，推荐系统和文本档案）的搜索、索引、分析和评估。这包括对搜索中人为因素的研究、桥接人工智能和信息检索以及特定领域的搜索应用程序。官网地址：https://dblp.uni-trier.de/db/journals/ir/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日