This technical report describes ChinaTelecom system for Track 1 (closed) of the VoxCeleb2023 Speaker Recognition Challenge (VoxSRC 2023). Our system consists of several ResNet variants trained only on VoxCeleb2, which were fused for better performance later. Score calibration was also applied for each variant and the fused system. The final submission achieved minDCF of 0.1066 and EER of 1.980%.
翻译:本技术报告描述了中国电信在VoxCeleb 2023说话人识别挑战赛(VoxSRC 2023)Track 1(封闭赛道)中的系统方案。我们的系统由多个仅在VoxCeleb2上训练的ResNet变体组成,后续通过融合策略进一步提升性能。针对每个变体及融合系统均进行了分数校准。最终提交结果实现了minDCF为0.1066,EER为1.980%。