Packet loss concealment (PLC) is a tool for enhancing speech degradation caused by poor network conditions or underflow/overflow in audio processing pipelines. We propose a real-time recurrent method that leverages previous outputs to mitigate artefact of lost packets without the prior knowledge of loss mask. The proposed full-band recurrent network (FRN) model operates at 48 kHz, which is suitable for high-quality telecommunication applications. Experiment results highlight the superiority of FRN over an offline non-causal baseline and a top performer in a recent PLC challenge.
翻译:丢包隐藏(PLC)是一种用于改善因网络条件不佳或音频处理管道中下溢/上溢导致的语音质量下降的工具。我们提出一种实时递归方法,该方法利用先前输出减轻丢失包带来的伪影,且无需了解丢包掩码的先验知识。所提出的全频带递归网络(FRN)模型在48 kHz采样率下运行,适用于高品质电信应用。实验结果表明,FRN在性能上优于离线非因果基线模型及近期PLC挑战赛中的最优方法。