MusicGen is a music generation language model (LM) that can be conditioned on textual descriptions and melodic features. We introduce MusicGen-Chord, which extends this capability by incorporating chord progression features. This model modifies one-hot encoded melody chroma vectors into multi-hot encoded chord chroma vectors, enabling the generation of music that reflects both chord progressions and textual descriptions. Furthermore, we developed MusicGen-Remixer, an application utilizing MusicGen-Chord to generate remixes of input music conditioned on textual descriptions. Both models are integrated into Replicate's web-UI using cog, facilitating broad accessibility and user-friendly controllable interaction for creating and experiencing AI-generated music.
翻译:MusicGen是一种可基于文本描述和旋律特征进行条件控制的音乐生成语言模型(LM)。本文提出的MusicGen-Chord通过引入和弦进行特征扩展了该能力。该模型将独热编码的旋律色谱向量修改为多热编码的和弦色谱向量,从而能够生成同时反映和弦进行与文本描述的音乐。此外,我们开发了MusicGen-Remixer应用程序,该程序利用MusicGen-Chord生成基于文本描述条件控制的输入音乐混音版本。两款模型均通过cog集成至Replicate平台的web-UI,为创建和体验AI生成音乐提供了广泛的访问途径与用户友好的可控交互体验。