Intent Detection in the Age of LLMs

Intent detection is a critical component of task-oriented dialogue systems (TODS) which enables the identification of suitable actions to address user utterances at each dialog turn. Traditional approaches relied on computationally efficient supervised sentence transformer encoder models, which require substantial training data and struggle with out-of-scope (OOS) detection. The emergence of generative large language models (LLMs) with intrinsic world knowledge presents new opportunities to address these challenges. In this work, we adapt 7 SOTA LLMs using adaptive in-context learning and chain-of-thought prompting for intent detection, and compare their performance with contrastively fine-tuned sentence transformer (SetFit) models to highlight prediction quality and latency tradeoff. We propose a hybrid system using uncertainty based routing strategy to combine the two approaches that along with negative data augmentation results in achieving the best of both worlds ( i.e. within 2% of native LLM accuracy with 50% less latency). To better understand LLM OOS detection capabilities, we perform controlled experiments revealing that this capability is significantly influenced by the scope of intent labels and the size of the label space. We also introduce a two-step approach utilizing internal LLM representations, demonstrating empirical gains in OOS detection accuracy and F1-score by >5% for the Mistral-7B model.

翻译：意图检测是面向任务对话系统（TODS）的关键组成部分，它能够在每个对话轮次识别合适的动作以响应用户话语。传统方法依赖于计算高效的监督式句子Transformer编码器模型，这些模型需要大量训练数据，并且在处理超出范围（OOS）检测时存在困难。具有内在世界知识的生成式大语言模型（LLMs）的出现为解决这些挑战提供了新的机遇。在本研究中，我们采用自适应上下文学习和思维链提示技术，对7种最先进的大语言模型进行适配，用于意图检测，并将其性能与对比微调的句子Transformer（SetFit）模型进行比较，以凸显预测质量与延迟之间的权衡。我们提出了一种混合系统，该系统采用基于不确定性的路由策略来结合这两种方法，并结合负数据增强技术，从而实现了两全其美的效果（即在延迟减少50%的情况下，准确率与大语言模型原生准确率的差距在2%以内）。为了更好地理解大语言模型的OOS检测能力，我们进行了对照实验，结果表明该能力显著受到意图标签范围和标签空间大小的影响。我们还介绍了一种利用大语言模型内部表示的两步方法，实验证明该方法使Mistral-7B模型的OOS检测准确率和F1分数提升了超过5%。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日