Neural network pruning offers an effective method for compressing a multilingual automatic speech recognition (ASR) model with minimal performance loss. However, it entails several rounds of pruning and re-training needed to be run for each language. In this work, we propose the use of an adaptive masking approach in two scenarios for pruning a multilingual ASR model efficiently, each resulting in sparse monolingual models or a sparse multilingual model (named as Dynamic ASR Pathways). Our approach dynamically adapts the sub-network, avoiding premature decisions about a fixed sub-network structure. We show that our approach outperforms existing pruning methods when targeting sparse monolingual models. Further, we illustrate that Dynamic ASR Pathways jointly discovers and trains better sub-networks (pathways) of a single multilingual model by adapting from different sub-network initializations, thereby reducing the need for language-specific pruning.
翻译:神经网络剪枝提供了一种有效压缩多语言自动语音识别(ASR)模型的方法,且性能损失极小。然而,传统剪枝需要针对每种语言进行多轮剪枝和重新训练。针对这一问题,本文提出在两种场景下使用自适应掩码方法对多语言ASR模型进行高效剪枝,分别生成稀疏单语言模型或稀疏多语言模型(命名为动态ASR路径)。该方法能够动态调整子网络,避免对固定子网络结构做出过早决策。实验表明,在生成稀疏单语言模型时,所提方法优于现有剪枝技术。进一步,我们证明动态ASR路径可以通过适应不同子网络初始化方式,在单个多语言模型中联合发现并训练更优的子网络(路径),从而减少对语言特定剪枝的需求。