AnyFit: Controllable Virtual Try-on for Any Combination of Attire Across Any Scenario

While image-based virtual try-on has made significant strides, emerging approaches still fall short of delivering high-fidelity and robust fitting images across various scenarios, as their models suffer from issues of ill-fitted garment styles and quality degrading during the training process, not to mention the lack of support for various combinations of attire. Therefore, we first propose a lightweight, scalable, operator known as Hydra Block for attire combinations. This is achieved through a parallel attention mechanism that facilitates the feature injection of multiple garments from conditionally encoded branches into the main network. Secondly, to significantly enhance the model's robustness and expressiveness in real-world scenarios, we evolve its potential across diverse settings by synthesizing the residuals of multiple models, as well as implementing a mask region boost strategy to overcome the instability caused by information leakage in existing models. Equipped with the above design, AnyFit surpasses all baselines on high-resolution benchmarks and real-world data by a large gap, excelling in producing well-fitting garments replete with photorealistic and rich details. Furthermore, AnyFit's impressive performance on high-fidelity virtual try-ons in any scenario from any image, paves a new path for future research within the fashion community.

翻译：尽管基于图像的虚拟试穿技术已取得显著进展，但现有方法仍难以在各种场景下生成高保真且鲁棒的试穿图像。这是因为现有模型存在服装风格适配不佳、训练过程中质量退化等问题，更不用说对多样化服饰组合的支持不足。因此，我们首先提出了一种轻量级、可扩展的服饰组合算子——Hydra Block。该算子通过并行注意力机制，将来自条件编码分支的多件服装特征注入到主网络中。其次，为显著增强模型在真实场景中的鲁棒性与表现力，我们通过合成多个模型的残差来扩展其在不同场景下的潜力，并采用掩码区域增强策略以克服现有模型中因信息泄露导致的不稳定性。基于上述设计，AnyFit 在高分辨率基准测试和真实数据上均以显著优势超越所有基线方法，能够生成贴合身形、充满逼真感与丰富细节的服装图像。此外，AnyFit 在任意场景、任意图像的高保真虚拟试穿中展现出的卓越性能，为时尚领域的未来研究开辟了新路径。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日