Video colour editing is a crucial task for content creation, yet existing solutions either require painstaking frame-by-frame manipulation or produce unrealistic results with temporal artefacts. We present a practical, training-free framework that makes precise video colour editing accessible through an intuitive interface while maintaining professional-quality output. Our key insight is that by decoupling spatial and temporal aspects of colour editing, we can better align with users' natural workflow -- allowing them to focus on precise colour selection in key frames before automatically propagating changes across time. We achieve this through a novel technical framework that combines: (i) a simple point-and-click interface merging grid-based colour selection with automatic instance segmentation for precise spatial control, (ii) bidirectional colour propagation that leverages inherent video motion patterns, and (iii) motion-aware blending that ensures smooth transitions even with complex object movements. Through extensive evaluation on diverse scenarios, we demonstrate that our approach matches or exceeds state-of-the-art methods while eliminating the need for training or specialized hardware, making professional-quality video colour editing accessible to everyone.
翻译:视频色彩编辑是内容创作中的关键任务,然而现有解决方案要么需要逐帧的繁琐操作,要么会产生带有时间伪影的不真实结果。我们提出了一种实用的免训练框架,通过直观的界面实现精确的视频色彩编辑,同时保持专业品质的输出。我们的核心见解是,通过解耦色彩编辑的空间与时间维度,可以更好地贴合用户自然的工作流程——允许用户在关键帧中专注于精确的色彩选择,随后自动将更改在时间维度上传播。我们通过一种新颖的技术框架实现这一目标,该框架结合了:(i)融合基于网格的色彩选择与自动实例分割的简易点选式界面,以实现精确的空间控制;(ii)利用视频固有运动模式的双向色彩传播机制;(iii)运动感知融合技术,确保即使在复杂物体运动下也能实现平滑过渡。通过对多样化场景的广泛评估,我们证明本方法在匹配或超越现有先进技术的同时,完全无需训练或专用硬件,使专业品质的视频色彩编辑对所有人开放。