In the art of video editing, sound helps add character to an object and immerse the viewer within a space. Through formative interviews with professional editors (N=10), we found that the task of adding sounds to video can be challenging. This paper presents Soundify, a system that assists editors in matching sounds to video. Given a video, Soundify identifies matching sounds, synchronizes the sounds to the video, and dynamically adjusts panning and volume to create spatial audio. In a human evaluation study (N=889), we show that Soundify is capable of matching sounds to video out-of-the-box for a diverse range of audio categories. In a within-subjects expert study (N=12), we demonstrate the usefulness of Soundify in helping video editors match sounds to video with lighter workload, reduced task completion time, and improved usability.
翻译:在视频剪辑艺术中,声音有助于为物体增添特色,让观众沉浸于场景之中。通过与专业剪辑师的前期访谈(N=10),我们发现为视频添加音效是一项具有挑战性的任务。本文提出Soundify系统,可辅助剪辑师完成音效与视频的匹配。给定一段视频,Soundify能自动识别匹配的音效,将其与视频同步,并动态调整声像定位与音量以营造空间音频效果。在人类评估研究(N=889)中,我们证明Soundify能够开箱即用地为多种音频类别匹配视频音效。在一项受试者内专家研究(N=12)中,我们验证了Soundify在帮助视频剪辑师以更低工作量、更短任务完成时间和更优可用性匹配音效方面的实用性。