Knowledge-Decoupled Functionally Invariant Path with Synthetic Personal Data for Personalized ASR

Fine-tuning generic ASR models with large-scale synthetic personal data can enhance the personalization of ASR models, but it introduces challenges in adapting to synthetic personal data without forgetting real knowledge, and in adapting to personal data without forgetting generic knowledge. Considering that the functionally invariant path (FIP) framework enables model adaptation while preserving prior knowledge, in this letter, we introduce FIP into synthetic-data-augmented personalized ASR models. However, the model still struggles to balance the learning of synthetic, personalized, and generic knowledge when applying FIP to train the model on all three types of data simultaneously. To decouple this learning process and further address the above two challenges, we integrate a gated parameter-isolation strategy into FIP and propose a knowledge-decoupled functionally invariant path (KDFIP) framework, which stores generic and personalized knowledge in separate modules and applies FIP to them sequentially. Specifically, KDFIP adapts the personalized module to synthetic and real personal data and the generic module to generic data. Both modules are updated along personalization-invariant paths, and their outputs are dynamically fused through a gating mechanism. With augmented synthetic data, KDFIP achieves a 29.38% relative character error rate reduction on target speakers and maintains comparable generalization performance to the unadapted ASR baseline.

翻译：利用大规模合成个人数据对通用ASR模型进行微调可增强模型的个性化能力，但会带来双重挑战：既要适应合成个人数据而不遗忘真实知识，又要适应个人数据而不丢失通用知识。考虑到功能不变路径框架能够在模型适应过程中保持先验知识，本文将该框架引入合成数据增强的个性化ASR模型。然而，当同时使用三类数据（合成、个性化、通用）通过FIP训练模型时，模型仍难以平衡三类知识的学习。为解耦学习过程并进一步解决上述挑战，我们将门控参数隔离策略整合至FIP，提出知识解耦的功能不变路径框架。该框架将通用知识与个性化知识分别存储于独立模块，并对其顺序应用FIP。具体而言，KDFIP使个性化模块适应合成及真实个人数据，通用模块适应通用数据。两个模块均沿个性化不变路径更新，其输出通过门控机制动态融合。在增强合成数据的支持下，KDFIP在目标说话人上实现了29.38%的相对字错误率下降，同时保持了与未适应ASR基线相当的泛化性能。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日