Retrieval augmentation is critical when Language Models (LMs) exploit non-parametric knowledge related to the query through external knowledge bases before reasoning. The retrieved information is incorporated into LMs as context alongside the query, enhancing the reliability of responses towards factual questions. Prior researches in retrieval augmentation typically follow a retriever-generator paradigm. In this context, traditional retrievers encounter challenges in precisely and seamlessly extracting query-relevant information from knowledge bases. To address this issue, this paper introduces a novel retrieval augmentation framework called ChatLR that primarily employs the powerful semantic understanding ability of Large Language Models (LLMs) as retrievers to achieve precise and concise information retrieval. Additionally, we construct an LLM-based search and question answering system tailored for the financial domain by fine-tuning LLM on two tasks including Text2API and API-ID recognition. Experimental results demonstrate the effectiveness of ChatLR in addressing user queries, achieving an overall information retrieval accuracy exceeding 98.8\%.
翻译:检索增强对于语言模型(LMs)在推理前通过外部知识库利用与查询相关的非参数知识至关重要。检索到的信息与查询一起作为上下文融入语言模型,提高了对事实性问题响应的可靠性。以往检索增强的研究通常遵循检索器-生成器范式。在此背景下,传统检索器在从知识库中精确且无缝地提取与查询相关信息时面临挑战。为解决该问题,本文提出一种名为ChatLR的新型检索增强框架,该框架主要利用大型语言模型(LLMs)强大的语义理解能力作为检索器,实现精确且简洁的信息检索。此外,我们通过在文本到API(Text2API)和API-ID识别两项任务上微调LLM,构建了一个面向金融领域的基于LLM的搜索与问答系统。实验结果表明ChatLR在解决用户查询方面具有有效性,整体信息检索准确率超过98.8%。