Trigger$^3$: Refining Query Correction via Adaptive Model Selector

In search scenarios, user experience can be hindered by erroneous queries due to typos, voice errors, or knowledge gaps. Therefore, query correction is crucial for search engines. Current correction models, usually small models trained on specific data, often struggle with queries beyond their training scope or those requiring contextual understanding. While the advent of Large Language Models (LLMs) offers a potential solution, they are still limited by their pre-training data and inference cost, particularly for complex queries, making them not always effective for query correction. To tackle these, we propose Trigger$^3$, a large-small model collaboration framework that integrates the traditional correction model and LLM for query correction, capable of adaptively choosing the appropriate correction method based on the query and the correction results from the traditional correction model and LLM. Trigger$^3$ first employs a correction trigger to filter out correct queries. Incorrect queries are then corrected by the traditional correction model. If this fails, an LLM trigger is activated to call the LLM for correction. Finally, for queries that no model can correct, a fallback trigger decides to return the original query. Extensive experiments demonstrate Trigger$^3$ outperforms correction baselines while maintaining efficiency.

翻译：在搜索场景中，因拼写错误、语音识别错误或知识差距导致的错误查询会损害用户体验。因此，查询纠错对搜索引擎至关重要。当前的纠错模型通常是基于特定数据训练的小型模型，往往难以处理超出其训练范围的查询或需要上下文理解的查询。虽然大型语言模型的出现提供了一种潜在的解决方案，但它们仍受限于预训练数据和推理成本，尤其对于复杂查询，使其在查询纠错中并非总是有效。为解决这些问题，我们提出了Trigger$^3$，一个大小模型协作框架，它整合了传统纠错模型与大型语言模型进行查询纠错，能够根据查询以及传统纠错模型和大型语言模型的纠错结果，自适应地选择合适的纠错方法。Trigger$^3$首先使用一个纠错触发器过滤掉正确的查询。错误的查询随后由传统纠错模型进行纠正。若此步骤失败，则激活一个大型语言模型触发器以调用大型语言模型进行纠错。最后，对于任何模型都无法纠正的查询，一个回退触发器决定返回原始查询。大量实验表明，Trigger$^3$在保持效率的同时，其性能优于现有的纠错基线方法。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日