We introduce StructuReiser, a novel video-to-video translation method that transforms input videos into stylized sequences using a set of user-provided keyframes. Unlike existing approaches, StructuReiser maintains strict adherence to the structural elements of the target video, preserving the original identity while seamlessly applying the desired stylistic transformations. This enables a level of control and consistency that was previously unattainable with traditional text-driven or keyframe-based methods. Furthermore, StructuReiser supports real-time inference and custom keyframe editing, making it ideal for interactive applications and expanding the possibilities for creative expression and video manipulation.
翻译:本文提出StructuReiser,一种新颖的视频到视频转换方法,它利用一组用户提供的关键帧将输入视频转换为风格化序列。与现有方法不同,StructuReiser严格遵循目标视频的结构元素,在无缝应用所需风格变换的同时保持原始身份特征。这实现了传统文本驱动或基于关键帧的方法此前无法达到的控制水平与一致性。此外,StructuReiser支持实时推理与自定义关键帧编辑,使其成为交互式应用的理想选择,并拓展了创意表达与视频操纵的可能性。