Film scores are considered an essential part of the film cinematic experience, but the process of film score generation is often expensive and infeasible for small-scale creators. Automating the process of film score composition would provide useful starting points for music in small projects. In this paper, we propose a two-stage pipeline for generating music from a movie script. The first phase is the Sentiment Analysis phase where the sentiment of a scene from the film script is encoded into the valence-arousal continuous space. The second phase is the Conditional Music Generation phase which takes as input the valence-arousal vector and conditionally generates piano MIDI music to match the sentiment. We study the efficacy of various music generation architectures by performing a qualitative user survey and propose methods to improve sentiment-conditioning in VAE architectures.
翻译:电影配乐被认为是电影观影体验的重要组成部分,但配乐生成过程往往成本高昂,对小型创作者而言并不可行。自动化电影配乐创作流程可为小型项目的音乐创作提供有用的起点。本文提出了一种从电影剧本生成音乐的两阶段流水线。第一阶段是情感分析阶段,将电影剧本中场景的情感编码至效价-唤醒连续空间。第二阶段是条件音乐生成阶段,以效价-唤醒向量为输入,条件生成与情感匹配的钢琴MIDI音乐。我们通过定性用户调查研究了多种音乐生成架构的有效性,并提出了在VAE架构中改进情感条件控制的方法。