AI-Synthesized Voice Detection Using Neural Vocoder Artifacts - 专知论文

会员服务 ·

0

可辨认的 · binary · 特征提取器 · MoDELS · DeepFakes ·

2023 年 4 月 25 日

AI-Synthesized Voice Detection Using Neural Vocoder Artifacts

翻译：基于神经声码器伪造痕迹的AI合成语音检测技术

Chengzhe Sun,Shan Jia,Shuwei Hou,Siwei Lyu

from arxiv, arXiv admin note: substantial text overlap with arXiv:2302.09198

Advancements in AI-synthesized human voices have created a growing threat of impersonation and disinformation, making it crucial to develop methods to detect synthetic human voices. This study proposes a new approach to identifying synthetic human voices by detecting artifacts of vocoders in audio signals. Most DeepFake audio synthesis models use a neural vocoder, a neural network that generates waveforms from temporal-frequency representations like mel-spectrograms. By identifying neural vocoder processing in audio, we can determine if a sample is synthesized. To detect synthetic human voices, we introduce a multi-task learning framework for a binary-class RawNet2 model that shares the feature extractor with a vocoder identification module. By treating vocoder identification as a pretext task, we constrain the feature extractor to focus on vocoder artifacts and provide discriminative features for the final binary classifier. Our experiments show that the improved RawNet2 model based on vocoder identification achieves high classification performance on the binary task overall. Codes and data can be found at \url{https://github.com/csun22/Synthetic-Voice-Detection-Vocoder-Artifacts}.

翻译：人工智能合成语音技术的进步带来了日益严重的模仿和虚假信息威胁，因此开发检测合成人类语音的方法变得至关重要。本研究提出一种通过检测音频信号中声码器伪造痕迹来识别合成人类语音的新方法。大多数深度伪造音频合成模型都采用神经声码器（一种基于时频表征如梅尔频谱图生成波形的神经网络）。通过识别音频中是否经过神经声码器处理，即可判定样本是否经过合成。为检测合成人类语音，我们提出一种面向二分类RawNet2模型的多任务学习框架，该框架的特征提取器与声码器识别模块共享参数。通过将声码器识别作为前置任务，我们约束特征提取器聚焦于声码器伪造痕迹，并为最终二分类器提供判别性特征。实验表明，基于声码器识别的改进RawNet2模型在二分类任务上整体实现了高分类性能。代码与数据见\url{https://github.com/csun22/Synthetic-Voice-Detection-Vocoder-Artifacts}。

0

相关内容

可辨认的

【2023新书】使用Python进行统计和数据可视化，554页pdf

【2023新书】使用Python进行统计和数据可视化，554页pdf

专知会员服务

130+阅读 · 2023年1月29日

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

专知

12+阅读 · 2018年3月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

基于地形辅助的深海长航时ARV自主导航技术研究

国家自然科学基金

15+阅读 · 2017年12月31日

反Prelog规则羰基还原酶立体选择性识别分子机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

油菜叶色基因BnaC.HO1的功能分析及其突变导致叶色变异的机理

国家自然科学基金

0+阅读 · 2015年12月31日

特异响应冷胁迫的DREB1/CBF基因亚家族在陆生植物中的演化

国家自然科学基金

0+阅读 · 2013年12月31日

Pictet–Spengler类反应机理的理论研究和新反应设计

国家自然科学基金

0+阅读 · 2013年12月31日

藤黄酸抗B细胞非霍奇金淋巴瘤新机制- - 调控SRC-3/组蛋白乙酰化转录复合物SUMO化修饰

国家自然科学基金

0+阅读 · 2012年12月31日

基于Affordance的详细设计知识建模、捕获与重用方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

脂肪组织巨噬细胞（ATMs）替代激活及调控在股骨头坏死发病机制及治疗中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

13q染色体末端先天性心脏病致病基因的鉴定及功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

SUMO/DeSUMO化修饰在抑制性受体膜转运中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

Language Models Can Learn Exceptions to Syntactic Rules

Arxiv

0+阅读 · 2023年6月9日

Speaker Embeddings as Individuality Proxy for Voice Stress Detection

Arxiv

0+阅读 · 2023年6月9日

Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect?

Arxiv

0+阅读 · 2023年6月9日

Learning Domain-Aware Detection Head with Prompt Tuning

Arxiv

0+阅读 · 2023年6月9日

Extensive Evaluation of Transformer-based Architectures for Adverse Drug Events Extraction

Arxiv

0+阅读 · 2023年6月8日

Multi-Architecture Multi-Expert Diffusion Models

Arxiv

0+阅读 · 2023年6月8日

Digital Audio Forensics: Blind Human Voice Mimicry Detection

Arxiv

0+阅读 · 2023年6月7日

Designing Decision Support Systems Using Counterfactual Prediction Sets

Arxiv

0+阅读 · 2023年6月6日

Neural Architecture Search without Training

Neural Architecture Search without Training

Arxiv

10+阅读 · 2021年6月11日

Neural Architecture Search: A Survey

Arxiv

12+阅读 · 2018年9月5日

VIP会员

文章信息

相关主题

特征提取器

最新内容

ICML 2026 | VOTP：用视频基础模型与最优传输，让离线偏好强化学习只需少量反馈

ICML 2026 | VOTP：用视频基础模型与最优传输，让离线偏好强化学习只需少量反馈

专知会员服务

1+阅读 · 今天13:56

多模态代码智能综述：从视觉输入到可执行代码系统

多模态代码智能综述：从视觉输入到可执行代码系统

专知会员服务

1+阅读 · 今天13:54

美国马六甲“三重网”概念：安全网、威慑网与杀伤网

美国马六甲“三重网”概念：安全网、威慑网与杀伤网

专知会员服务

4+阅读 · 今天8:18

《面向导弹有效发射时机的监督机器学习方法：基于超视距空战仿真》

《面向导弹有效发射时机的监督机器学习方法：基于超视距空战仿真》

专知会员服务

3+阅读 · 今天7:39

《通用大语言模型：无人机指挥与控制接口》最新40页

《通用大语言模型：无人机指挥与控制接口》最新40页

专知会员服务

9+阅读 · 今天7:33

《通过小型无人机系统将情报能力“作战化”》

《通过小型无人机系统将情报能力“作战化”》

专知会员服务

3+阅读 · 今天7:28

《神经安全型有人–无人协同：面向认知自适应作战能力的参考架构》

《神经安全型有人–无人协同：面向认知自适应作战能力的参考架构》

专知会员服务

6+阅读 · 今天7:14

《在指挥链中通过多准则决策分析传达指挥官意图：空战实验》

《在指挥链中通过多准则决策分析传达指挥官意图：空战实验》

专知会员服务

18+阅读 · 6月15日

消耗优势：美军的“精确规模化”概念

消耗优势：美军的“精确规模化”概念

专知会员服务

7+阅读 · 6月15日

五角大楼的AI优先战略及其对现代战争的启示：来自与伊朗冲突的经验教训

五角大楼的AI优先战略及其对现代战争的启示：来自与伊朗冲突的经验教训

专知会员服务

9+阅读 · 6月15日

《网络空间兵棋推演：挑战、局限性与混合路径》报告

《网络空间兵棋推演：挑战、局限性与混合路径》报告

专知会员服务

8+阅读 · 6月15日

《离线语言支持系统：面向空战战术决策》

《离线语言支持系统：面向空战战术决策》

专知会员服务

8+阅读 · 6月15日

《以通信为中心的6G–LLM架构：面向可扩展的战术自主防御车辆网络》

《以通信为中心的6G–LLM架构：面向可扩展的战术自主防御车辆网络》

专知会员服务

7+阅读 · 6月15日

ICML 2026｜ECA：面向开放式图文生成的高效持续对齐

ICML 2026｜ECA：面向开放式图文生成的高效持续对齐

专知会员服务

6+阅读 · 6月14日

可信智能体AI综述：安全、鲁棒性、隐私与系统安全

可信智能体AI综述：安全、鲁棒性、隐私与系统安全

专知会员服务

6+阅读 · 6月14日

相关VIP内容

【2023新书】使用Python进行统计和数据可视化，554页pdf

【2023新书】使用Python进行统计和数据可视化，554页pdf

专知会员服务

130+阅读 · 2023年1月29日

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

多模态代码智能综述：从视觉输入到可执行代码系统

《面向导弹有效发射时机的监督机器学习方法：基于超视距空战仿真》

ICML 2026 | VOTP：用视频基础模型与最优传输，让离线偏好强化学习只需少量反馈

美国马六甲“三重网”概念：安全网、威慑网与杀伤网

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

专知

12+阅读 · 2018年3月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

相关论文

Language Models Can Learn Exceptions to Syntactic Rules

Arxiv

0+阅读 · 2023年6月9日

Speaker Embeddings as Individuality Proxy for Voice Stress Detection

Arxiv

0+阅读 · 2023年6月9日

Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect?

Arxiv

0+阅读 · 2023年6月9日

Learning Domain-Aware Detection Head with Prompt Tuning

Arxiv

0+阅读 · 2023年6月9日

Extensive Evaluation of Transformer-based Architectures for Adverse Drug Events Extraction

Arxiv

0+阅读 · 2023年6月8日

Multi-Architecture Multi-Expert Diffusion Models

Arxiv

0+阅读 · 2023年6月8日

Digital Audio Forensics: Blind Human Voice Mimicry Detection

Arxiv

0+阅读 · 2023年6月7日

Designing Decision Support Systems Using Counterfactual Prediction Sets

Arxiv

0+阅读 · 2023年6月6日

Neural Architecture Search without Training

Neural Architecture Search without Training

Arxiv

10+阅读 · 2021年6月11日

Neural Architecture Search: A Survey

Arxiv

12+阅读 · 2018年9月5日

相关基金

基于地形辅助的深海长航时ARV自主导航技术研究

国家自然科学基金

15+阅读 · 2017年12月31日

反Prelog规则羰基还原酶立体选择性识别分子机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

油菜叶色基因BnaC.HO1的功能分析及其突变导致叶色变异的机理

国家自然科学基金

0+阅读 · 2015年12月31日

特异响应冷胁迫的DREB1/CBF基因亚家族在陆生植物中的演化

国家自然科学基金

0+阅读 · 2013年12月31日

Pictet–Spengler类反应机理的理论研究和新反应设计

国家自然科学基金

0+阅读 · 2013年12月31日

藤黄酸抗B细胞非霍奇金淋巴瘤新机制- - 调控SRC-3/组蛋白乙酰化转录复合物SUMO化修饰

国家自然科学基金

0+阅读 · 2012年12月31日

基于Affordance的详细设计知识建模、捕获与重用方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

脂肪组织巨噬细胞（ATMs）替代激活及调控在股骨头坏死发病机制及治疗中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

13q染色体末端先天性心脏病致病基因的鉴定及功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

SUMO/DeSUMO化修饰在抑制性受体膜转运中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员