We study object motion path editing in videos, where the goal is to alter a target object's trajectory while preserving the original scene content. Unlike prior video editing methods that primarily manipulate appearance or rely on point-track-based trajectory control, which is often challenging for users to provide during inference, especially in videos with camera motion, we offer a practical, easy-to-use approach to controllable object-centric motion editing. We present Trace, a framework that enables users to design the desired trajectory in a single anchor frame and then synthesizes a temporally consistent edited video. Our approach addresses this task with a two-stage pipeline: a cross-view motion transformation module that maps first-frame path design to frame-aligned box trajectories under camera motion, and a motion-conditioned video re-synthesis module that follows these trajectories to regenerate the object while preserving the remaining content of the input video. Experiments on diverse real-world videos show that our method produces more coherent, realistic, and controllable motion edits than recent image-to-video and video-to-video methods.
翻译:我们研究视频中的目标运动路径编辑问题,目标是在保留原始场景内容的同时改变目标物体的运动轨迹。与主要操控外观或依赖基于点跟踪的轨迹控制(此类方法在推理时往往难以让用户提供信息,尤其是存在相机运动的视频中)等先前的视频编辑方法不同,我们提出一种实用且易于操作的物体中心运动可控编辑方法。我们提出Trace框架,使用户能够在单个锚定帧中设计期望轨迹,然后合成时间一致性编辑视频。该方法采用两阶段流程实现:跨视角运动变换模块将首帧路径设计映射为相机运动下的帧对齐框轨迹,以及运动条件视频重合成模块跟随这些轨迹重新生成目标物体,同时保留输入视频的其余内容。在多样化真实视频上的实验表明,与近期图像到视频和视频到视频方法相比,本方法能产生更连贯、真实且可控的运动编辑效果。