In the art of video editing, sound helps add character to an object and immerse the viewer within a space. Through formative interviews with professional editors (N=10), we found that the task of adding sounds to video can be challenging. This paper presents Soundify, a system that assists editors in matching sounds to video. Given a video, Soundify identifies matching sounds, synchronizes the sounds to the video, and dynamically adjusts panning and volume to create spatial audio. In a human evaluation study (N=889), we show that Soundify is capable of matching sounds to video out-of-the-box for a diverse range of audio categories. In a within-subjects expert study (N=12), we demonstrate the usefulness of Soundify in helping video editors match sounds to video with lighter workload, reduced task completion time, and improved usability.
翻译:在视频剪辑艺术中,声音有助于为物体增添特性并使观众沉浸于场景之中。通过对专业剪辑师(N=10)的初步访谈,我们发现为视频添加声音的任务可能具有挑战性。本文提出Soundify系统,旨在辅助剪辑师为视频匹配声音。给定一段视频,Soundify能够识别匹配的音效,将音效与视频同步,并动态调整声像定位与音量以创建空间音频。在一项人类评估研究(N=889)中,我们证明Soundify能够为多种音频类别实现开箱即用的视频音效匹配。在一项被试内专家研究(N=12)中,我们验证了Soundify在帮助视频剪辑师匹配音效方面的实用性,其能减轻工作负荷、缩短任务完成时间并提升可用性。