To promote speech processing and recognition research in driving scenarios, we build on the success of the Intelligent Cockpit Speech Recognition Challenge (ICSRC) held at ISCSLP 2022 and launch the ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition (ICMC-ASR) Challenge. This challenge collects over 100 hours of multi-channel speech data recorded inside a new energy vehicle and 40 hours of noise for data augmentation. Two tracks, including automatic speech recognition (ASR) and automatic speech diarization and recognition (ASDR) are set up, using character error rate (CER) and concatenated minimum permutation character error rate (cpCER) as evaluation metrics, respectively. Overall, the ICMC-ASR Challenge attracts 98 participating teams and receives 53 valid results in both tracks. In the end, first-place team USTCiflytek achieves a CER of 13.16% in the ASR track and a cpCER of 21.48% in the ASDR track, showing an absolute improvement of 13.08% and 51.4% compared to our challenge baseline, respectively.
翻译:为推动驾驶场景下的语音处理与识别研究,我们在ISCSLP 2022智能座舱语音识别挑战赛(ICSRC)成功举办的基础上,发起ICASSP 2024车载多通道自动语音识别挑战赛(ICMC-ASR)。本挑战赛采集了新能源汽车内超过100小时的多通道语音数据,并额外收录40小时噪声用于数据增强。赛事设置自动语音识别(ASR)与自动语音说话人日志及识别(ASDR)两个赛道,分别采用字符错误率(CER)和串联最小排列字符错误率(cpCER)作为评估指标。本届ICMC-ASR挑战赛共吸引98支参赛团队,双赛道共收到53项有效结果。最终,USTCiflytek团队以ASR赛道13.16%的CER和ASDR赛道21.48%的cpCER夺得双料冠军,相较于赛事基线分别实现13.08%和51.4%的绝对提升。