Meta-Learning Framework for End-to-End Imposter Identification in Unseen Speaker Recognition

Speaker identification systems are deployed in diverse environments, often different from the lab conditions on which they are trained and tested. In this paper, first, we show the problem of generalization using fixed thresholds (computed using EER metric) for imposter identification in unseen speaker recognition and then introduce a robust speaker-specific thresholding technique for better performance. Secondly, inspired by the recent use of meta-learning techniques in speaker verification, we propose an end-to-end meta-learning framework for imposter detection which decouples the problem of imposter detection from unseen speaker identification. Thus, unlike most prior works that use some heuristics to detect imposters, the proposed network learns to detect imposters by leveraging the utterances of the enrolled speakers. Furthermore, we show the efficacy of the proposed techniques on VoxCeleb1, VCTK and the FFSVC 2022 datasets, beating the baselines by up to 10%.

翻译：说话人识别系统部署于多样化环境中，这些环境往往与模型训练和测试的实验室条件存在差异。本文首先揭示了在未知说话人识别场景中采用固定阈值（基于等错误率指标计算）进行冒名者识别时存在的泛化问题，继而提出了一种面向说话人的鲁棒阈值自适应技术以提升性能。其次，受元学习技术在说话人验证领域最新应用的启发，我们构建了一个用于冒名者检测的端到端元学习框架，该框架将冒名者检测问题与未知说话人识别进行解耦。与多数采用启发式规则检测冒名者的现有方法不同，本文提出的网络能够通过利用注册说话人的语音表征自主学习冒名者检测。此外，在VoxCeleb1、VCTK和FFSVC 2022数据集上的实验结果表明，所提方法较基线系统性能提升最高达10%。

相关内容

声纹识别

关注 444

说话人识别（Speaker Recognition），或者称为声纹识别（Voiceprint Recognition, VPR），是根据语音中所包含的说话人个性信息，利用计算机以及现在的信息识别技术，自动鉴别说话人身份的一种生物特征识别技术。说话人识别研究的目的就是从语音中提取具有说话人表征性的特征，建立有效的模型和系统，实现自动精准的说话人鉴别。

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Query2box: 使用盒嵌入对向量空间中的知识图谱进行推理，Query2box: Reasoning over Knowledge Graphs in Vector Space Using Box Embeddings

专知会员服务

46+阅读 · 2020年5月11日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日