In the art of video editing, sound helps add character to an object and immerse the viewer within a space. Through formative interviews with professional editors (N=10), we found that the task of adding sounds to video can be challenging. This paper presents Soundify, a system that assists editors in matching sounds to video. Given a video, Soundify identifies matching sounds, synchronizes the sounds to the video, and dynamically adjusts panning and volume to create spatial audio. In a human evaluation study (N=889), we show that Soundify is capable of matching sounds to video out-of-the-box for a diverse range of audio categories. In a within-subjects expert study (N=12), we demonstrate the usefulness of Soundify in helping video editors match sounds to video with lighter workload, reduced task completion time, and improved usability.
翻译:在视频剪辑艺术中,音效有助于赋予对象个性,并让观众沉浸于空间之中。通过对专业剪辑师(N=10)的形成性访谈,我们发现为视频添加音效是一项具有挑战性的任务。本文提出Soundify系统,该系统能够帮助剪辑师为视频匹配音效。给定一段视频,Soundify可识别匹配的音效,将其与视频同步,并动态调整声像定位和音量以营造空间音频效果。在一项人类评估研究(N=889)中,我们证明Soundify能够开箱即用地为多样化的音频类别匹配视频音效。在一项受试者内专家研究(N=12)中,我们验证了Soundify在帮助视频剪辑师匹配音效方面的实用性,可减轻工作负荷、缩短任务完成时间并提升易用性。