From Real to Cloned Singer Identification

Cloned voices of popular singers sound increasingly realistic and have gained popularity over the past few years. They however pose a threat to the industry due to personality rights concerns. As such, methods to identify the original singer in synthetic voices are needed. In this paper, we investigate how singer identification methods could be used for such a task. We present three embedding models that are trained using a singer-level contrastive learning scheme, where positive pairs consist of segments with vocals from the same singers. These segments can be mixtures for the first model, vocals for the second, and both for the third. We demonstrate that all three models are highly capable of identifying real singers. However, their performance deteriorates when classifying cloned versions of singers in our evaluation set. This is especially true for models that use mixtures as an input. These findings highlight the need to understand the biases that exist within singer identification systems, and how they can influence the identification of voice deepfakes in music.

翻译：近年来，流行歌手的克隆声音日益逼真并广受欢迎。然而，由于人格权方面的担忧，这些克隆声音对音乐产业构成了威胁。因此，需要开发能够识别合成声音中原始歌手的方法。本文研究了如何将歌手识别方法应用于此类任务。我们提出了三种嵌入模型，这些模型采用歌手级别的对比学习方案进行训练，其中正样本对由来自同一歌手的含人声片段构成。对于第一个模型，这些片段可以是混音版本；对于第二个模型，可以是纯人声版本；对于第三个模型，则两者兼有。我们证明所有三个模型在识别真实歌手方面都表现出色。然而，在对评估集中的歌手克隆版本进行分类时，它们的性能出现下降。对于使用混音作为输入的模型，这种现象尤为明显。这些发现凸显了理解歌手识别系统中存在的偏见及其如何影响音乐中声音深度伪造识别的必要性。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日