An initial real-time speech enhancement method is presented to reduce the effects of additive noise. The method operates in the frequency domain and is a form of spectral subtraction. Initially, minimum statistics are used to generate an estimate of the noise signal in the frequency domain. The use of minimum statistics avoids the need for a voice activity detector (VAD) which has proven to be challenging to create. As minimum statistics are used, the noise signal estimate must be multiplied by a scaling factor before subtraction from the noise corrupted speech signal can take place. A spectral floor is applied to the difference to suppress the effects of "musical noise". Finally, a series of further enhancements are considered to reduce the effects of residual noise even further. These methods are compared using time-frequency plots to create the final speech enhancement design
翻译:提出了一种初始的实时语音增强方法,用于降低加性噪声的影响。该方法在频域中运行,属于谱减法的一种形式。首先,利用最小统计在频域中生成噪声信号的估计。最小统计的使用避免了对语音活动检测器(VAD)的需求,而VAD的构建已被证明具有挑战性。由于采用了最小统计,噪声信号估计在从带噪语音信号中减去之前必须乘以一个缩放因子。在差值上施加谱底,以抑制“音乐噪声”的影响。最后,考虑了一系列进一步增强措施,以进一步降低残留噪声的影响。通过使用时频图比较这些方法,以构建最终的语音增强设计方案。