MusicLM: Generating Music From Text

Andrea Agostinelli,Timo I. Denk,Zalán Borsos,Jesse Engel,Mauro Verzetti,Antoine Caillon,Qingqing Huang,Aren Jansen,Adam Roberts,Marco Tagliasacchi,Matt Sharifi,Neil Zeghidour,Christian Frank

from arxiv, Supplementary material at https://google-research.github.io/seanet/musiclm/examples and https://kaggle.com/datasets/googleai/musiccaps

We introduce MusicLM, a model generating high-fidelity music from text descriptions such as "a calming violin melody backed by a distorted guitar riff". MusicLM casts the process of conditional music generation as a hierarchical sequence-to-sequence modeling task, and it generates music at 24 kHz that remains consistent over several minutes. Our experiments show that MusicLM outperforms previous systems both in audio quality and adherence to the text description. Moreover, we demonstrate that MusicLM can be conditioned on both text and a melody in that it can transform whistled and hummed melodies according to the style described in a text caption. To support future research, we publicly release MusicCaps, a dataset composed of 5.5k music-text pairs, with rich text descriptions provided by human experts.

翻译：我们提出了MusicLM，这是一个能够根据文本描述（例如“被扭曲吉他即兴段衬托的宁静小提琴旋律”）生成高保真音乐的模型。MusicLM将条件音乐生成过程建模为层次化的序列到序列任务，并以24 kHz的采样率生成持续数分钟且保持一致的音频。实验表明，MusicLM在音频质量和对文本描述的遵循程度上均优于先前系统。此外，我们展示了MusicLM可以同时以文本和旋律为条件，从而能够根据文本说明中描述的风格转换吹口哨或哼唱的旋律。为支持未来研究，我们公开了MusicCaps数据集，该数据集包含5500对音乐-文本描述，并由人类专家提供了丰富的文本说明。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

67页PPT【ML+气象】使用机器学习技术对季节和次季节研究和预测，Use of Machine Learning Techniques for Seasonal and Subseasonal Studies and Predictions

专知会员服务

19+阅读 · 2022年3月4日

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日