Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs

The integration of large language models (LLMs) and search engines represents a significant evolution in knowledge acquisition methodologies. However, determining the knowledge that an LLM already possesses and the knowledge that requires the help of a search engine remains an unresolved issue. Most existing methods solve this problem through the results of preliminary answers or reasoning done by the LLM itself, but this incurs excessively high computational costs. This paper introduces a novel collaborative approach, namely SlimPLM, that detects missing knowledge in LLMs with a slim proxy model, to enhance the LLM's knowledge acquisition process. We employ a proxy model which has far fewer parameters, and take its answers as heuristic answers. Heuristic answers are then utilized to predict the knowledge required to answer the user question, as well as the known and unknown knowledge within the LLM. We only conduct retrieval for the missing knowledge in questions that the LLM does not know. Extensive experimental results on five datasets with two LLMs demonstrate a notable improvement in the end-to-end performance of LLMs in question-answering tasks, achieving or surpassing current state-of-the-art models with lower LLM inference costs.

翻译：大语言模型（LLMs）与搜索引擎的融合代表了知识获取方法论的重大演进。然而，确定LLM已具备的知识以及需要借助搜索引擎获取的知识仍是一个未解决的问题。现有方法大多通过LLM自身生成的初步答案或推理结果来解决此问题，但这会导致过高的计算成本。本文提出了一种新颖的协作方法，即SlimPLM，通过精简代理模型检测LLM中缺失的知识，从而增强LLM的知识获取过程。我们采用一个参数规模远小于LLM的代理模型，并将其回答视为启发式答案。随后，利用这些启发式答案来预测回答用户问题所需的知识，以及LLM中已知和未知的知识。我们仅对LLM未知的问题中缺失的知识进行检索。在五个数据集上使用两个LLM进行的大量实验结果表明，该方法显著提升了LLM在问答任务中的端到端性能，以更低的LLM推理成本达到或超越了当前最先进的模型。

相关内容

大语言模型

关注 66

大语言模型是基于海量文本数据训练的深度学习模型。它不仅能够生成自然语言文本，还能够深入理解文本含义，处理各种自然语言任务，如文本摘要、问答、翻译等。2023年，大语言模型及其在人工智能领域的应用已成为全球科技研究的热点，其在规模上的增长尤为引人注目，参数量已从最初的十几亿跃升到如今的一万亿。参数量的提升使得模型能够更加精细地捕捉人类语言微妙之处，更加深入地理解人类语言的复杂性。在过去的一年里，大语言模型在吸纳新知识、分解复杂任务以及图文对齐等多方面都有显著提升。随着技术的不断成熟，它将不断拓展其应用范围，为人类提供更加智能化和个性化的服务，进一步改善人们的生活和生产方式。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日