In this work, we present CCEdit, a versatile framework designed to address the challenges of creative and controllable video editing. CCEdit accommodates a wide spectrum of user editing requirements and enables enhanced creative control through an innovative approach that decouples video structure and appearance. We leverage the foundational ControlNet architecture to preserve structural integrity, while seamlessly integrating adaptable temporal modules compatible with state-of-the-art personalization techniques for text-to-image generation, such as DreamBooth and LoRA.Furthermore, we introduce reference-conditioned video editing, empowering users to exercise precise creative control over video editing through the more manageable process of editing key frames. Our extensive experimental evaluations confirm the exceptional functionality and editing capabilities of the proposed CCEdit framework. Demo video is available at https://www.youtube.com/watch?v=UQw4jq-igN4.
翻译:本文提出CCEdit,一个旨在解决创意可控视频编辑挑战的多功能框架。CCEdit可适配用户广泛的编辑需求,通过解耦视频结构与外观的创新方法增强创意控制能力。我们利用ControlNet基础架构保持结构完整性,同时无缝集成与最新文本到图像个性化技术(如DreamBooth和LoRA)兼容的自适应时序模块。此外,我们引入参考条件视频编辑,通过更易操作的关键帧编辑过程,使用户能够对视频编辑实现精确的创意控制。大量实验评估验证了所提CCEdit框架的卓越功能与编辑能力。演示视频见https://www.youtube.com/watch?v=UQw4jq-igN4。