Leveraging Neural Representations for Audio Manipulation

We investigate applying audio manipulations using pretrained neural network-based autoencoders as an alternative to traditional signal processing methods, since the former may provide greater semantic or perceptual organization. To establish the potential of this approach, we first establish if representations from these models encode information about manipulations. We carry out experiments and produce visualizations using representations from two different pretrained autoencoders. Our findings indicate that, while some information about audio manipulations is encoded, this information is both limited and encoded in a non-trivial way. This is supported by our attempts to visualize these representations, which demonstrated that trajectories of representations for common manipulations are typically nonlinear and content dependent, even for linear signal manipulations. As a result, it is not yet clear how these pretrained autoencoders can be used to manipulate audio signals, however, our results indicate this may be due to the lack of disentanglement with respect to common audio manipulations.

翻译：我们探究使用基于预训练神经网络的自动编码器作为传统信号处理方法的替代方案来执行音频操作，因为前者可能提供更强的语义或感知组织能力。为确立该方法的应用潜力，我们首先验证这些模型的表征是否编码了与操作相关的信息。我们使用两种不同预训练自动编码器的表征开展实验并生成可视化结果。研究结果表明，尽管音频操作的部分信息被编码，但该信息既有限又以非简单方式编码。这一发现得到表征可视化尝试的佐证——可视化显示常见操作的表征轨迹通常是非线性且内容相关的，即使对线性信号操作而言亦如此。因此，目前尚不明确如何利用这些预训练自动编码器操作音频信号，但我们的结果暗示，这可能是由于其对常见音频操作缺乏解耦性所致。

相关内容

自编码器

关注 141

自动编码器是一种人工神经网络，用于以无监督的方式学习有效的数据编码。自动编码器的目的是通过训练网络忽略信号“噪声”来学习一组数据的表示（编码），通常用于降维。与简化方面一起，学习了重构方面，在此，自动编码器尝试从简化编码中生成尽可能接近其原始输入的表示形式，从而得到其名称。基本模型存在几种变体，其目的是迫使学习的输入表示形式具有有用的属性。自动编码器可有效地解决许多应用问题，从面部识别到获取单词的语义。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【CVPR 2022】多模态视频字幕的端到端生成预训练，End-to-end Generative Pretraining for Multimodal Video Captioning

专知会员服务

27+阅读 · 2022年3月3日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日