CoDi: Conversational Distillation for Grounded Question Answering

Distilling conversational skills into Small Language Models (SLMs) with approximately 1 billion parameters presents significant challenges. Firstly, SLMs have limited capacity in their model parameters to learn extensive knowledge compared to larger models. Secondly, high-quality conversational datasets are often scarce, small, and domain-specific. Addressing these challenges, we introduce a novel data distillation framework named CoDi (short for Conversational Distillation, pronounced "Cody"), allowing us to synthesize large-scale, assistant-style datasets in a steerable and diverse manner. Specifically, while our framework is task agnostic at its core, we explore and evaluate the potential of CoDi on the task of conversational grounded reasoning for question answering. This is a typical on-device scenario for specialist SLMs, allowing for open-domain model responses, without requiring the model to "memorize" world knowledge in its limited weights. Our evaluations show that SLMs trained with CoDi-synthesized data achieve performance comparable to models trained on human-annotated data in standard metrics. Additionally, when using our framework to generate larger datasets from web data, our models surpass larger, instruction-tuned models in zero-shot conversational grounded reasoning tasks.

翻译：将对话技能蒸馏至约10亿参数的小型语言模型（SLMs）面临重大挑战。首先，与更大规模模型相比，SLMs的模型参数容量有限，难以学习海量知识。其次，高质量的对话数据集往往稀缺、规模小且具有领域特定性。为应对这些挑战，我们提出了一种名为CoDi（对话蒸馏的简称，发音为"Cody"）的新型数据蒸馏框架，该框架能以可引导且多样化的方式合成大规模助手风格数据集。具体而言，虽然本框架核心设计是任务无关的，但我们针对对话式接地推理问答任务探索并评估了CoDi的潜力。这是专业SLMs在设备端运行的典型场景，支持开放领域模型响应，且无需模型在其有限权重中"记忆"世界知识。评估结果表明，使用CoDi合成数据训练的SLMs在标准指标上达到了与人工标注数据训练模型相当的性能。此外，当使用本框架从网络数据生成更大规模数据集时，我们的模型在零样本对话接地推理任务中超越了更大规模的指令微调模型。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一种无需使用负样本的自监督学习方法，Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes

专知会员服务

15+阅读 · 2022年3月12日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日