Realistic and efficient 3D garment generation remains a longstanding challenge in computer vision and digital fashion. Existing methods typically rely on large vision- language models to produce serialized representations of 2D sewing patterns, which are then transformed into simulation-ready 3D meshes using garment modeling framework such as GarmentCode. Although these approaches yield high-quality results, they often suffer from slow inference times, ranging from 30 seconds to a minute. In this work, we introduce SwiftTailor, a novel two-stage framework that unifies sewing-pattern reasoning and geometry-based mesh synthesis through a compact geometry image representation. SwiftTailor comprises two lightweight modules: PatternMaker, an efficient vision-language model that predicts sewing patterns from diverse input modalities, and GarmentSewer, an efficient dense prediction transformer that converts these patterns into a novel Garment Geometry Image, encoding the 3D surface of all garment panels in a unified UV space. The final 3D mesh is reconstructed through an efficient inverse mapping process that incorporates remeshing and dynamic stitching algorithms to directly assemble the garment, thereby amortizing the cost of physical simulation. Extensive experiments on the Multimodal GarmentCodeData demonstrate that SwiftTailor achieves state-of-the-art accuracy and visual fidelity while significantly reducing inference time. This work offers a scalable, interpretable, and high-performance solution for next-generation 3D garment generation.
翻译:在计算机视觉和数字时尚领域,实现逼真且高效的三维服装生成仍是一项长期挑战。现有方法通常依赖大型视觉语言模型生成二维缝纫线迹的序列化表示,再通过GarmentCode等服装建模框架将其转化为可模拟的三维网格。尽管这些方法能产出高质量结果,但推理时间较长(30秒至一分钟)。为此,我们提出SwiftTailor——一种新颖的两阶段框架,通过紧凑的几何图像表示统一了缝纫线迹推理与基于几何的网格合成。SwiftTailor包含两个轻量模块:PatternMaker(一种高效的视觉语言模型,能从多种输入模态预测缝纫线迹)和GarmentSewer(一种高效密集预测Transformer,可将缝纫线迹转化为新型服装几何图像,在统一UV空间中编码所有服装面板的三维表面)。最终三维网格通过结合重网格化与动态缝合算法的高效逆映射过程直接组装服装,从而分摊物理模拟的计算成本。在多模态GarmentCodeData上的大量实验表明,SwiftTailor在显著缩短推理时间的同时,实现了最先进的精度与视觉保真度。本工作为下一代三维服装生成提供了一种可扩展、可解释且高性能的解决方案。