Fashion-VDM: Video Diffusion Model for Virtual Try-On

We present Fashion-VDM, a video diffusion model (VDM) for generating virtual try-on videos. Given an input garment image and person video, our method aims to generate a high-quality try-on video of the person wearing the given garment, while preserving the person's identity and motion. Image-based virtual try-on has shown impressive results; however, existing video virtual try-on (VVT) methods are still lacking garment details and temporal consistency. To address these issues, we propose a diffusion-based architecture for video virtual try-on, split classifier-free guidance for increased control over the conditioning inputs, and a progressive temporal training strategy for single-pass 64-frame, 512px video generation. We also demonstrate the effectiveness of joint image-video training for video try-on, especially when video data is limited. Our qualitative and quantitative experiments show that our approach sets the new state-of-the-art for video virtual try-on. For additional results, visit our project page: https://johannakarras.github.io/Fashion-VDM.

翻译：本文提出Fashion-VDM，一种用于生成虚拟试穿视频的视频扩散模型。给定输入的服装图像和人物视频，我们的方法旨在生成人物穿着指定服装的高质量试穿视频，同时保持人物身份特征与运动姿态。基于图像的虚拟试穿已展现出令人印象深刻的效果；然而，现有的视频虚拟试穿方法仍存在服装细节缺失与时序一致性问题。为解决这些挑战，我们提出一种基于扩散架构的视频虚拟试穿方案，包括：采用分割分类器自由引导以增强对条件输入的控制，以及通过渐进式时序训练策略实现单次生成64帧、512像素的视频。我们还验证了在视频数据有限时，联合图像-视频训练对视频试穿任务的有效性。定性与定量实验表明，我们的方法在视频虚拟试穿任务上达到了新的最优性能。更多结果请访问项目页面：https://johannakarras.github.io/Fashion-VDM。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日