In this paper we present a novel approach for environment classification for speech recordings, which does not require the selection of decaying reverberation tails. It is based on a multi-band RT60 analysis of blind channel estimates and achieves an accuracy of up to 93.6% on test recordings derived from the ACE corpus.
翻译:本文提出一种针对语音录制的环境分类新方法,无需选择衰减混响尾音。该方法基于对盲信道估计的多频带RT60分析,在源自ACE语料库的测试录音上实现了高达93.6%的分类准确率。