Img2CAD：基于结构化视觉几何的单图像条件化三维CAD模型生成 (Img2CAD: Conditioned 3D CAD Model Generation from Single Image with Structured Visual Geometry)

In this paper, we propose Img2CAD, the first approach to our knowledge that uses 2D image inputs to generate CAD models with editable parameters. Unlike existing AI methods for 3D model generation using text or image inputs often rely on mesh-based representations, which are incompatible with CAD tools and lack editability and fine control, Img2CAD enables seamless integration between AI-based 3D reconstruction and CAD software. We have identified an innovative intermediate representation called Structured Visual Geometry (SVG), characterized by vectorized wireframes extracted from objects. This representation significantly enhances the performance of generating conditioned CAD models. Additionally, we introduce two new datasets to further support research in this area: ABC-mono, the largest known dataset comprising over 200,000 3D CAD models with rendered images, and KOCAD, the first dataset featuring real-world captured objects alongside their ground truth CAD models, supporting further research in conditioned CAD model generation.

翻译：本文提出Img2CAD方法，据我们所知，这是首个利用二维图像输入生成具有可编辑参数CAD模型的技术。与现有基于文本或图像输入、通常依赖网格表示的三维模型生成AI方法不同——这类网格表示与CAD工具不兼容且缺乏可编辑性与精细控制——Img2CAD实现了基于AI的三维重建与CAD软件间的无缝集成。我们提出了一种创新的中间表示方法，称为结构化视觉几何（SVG），其特征是从物体中提取的矢量化线框。该表示方法显著提升了生成条件化CAD模型的性能。此外，我们引入两个新数据集以进一步支持该领域研究：ABC-mono是目前已知最大的数据集，包含超过20万个附带渲染图像的三维CAD模型；KOCAD则是首个包含真实世界采集物体及其对应真实CAD模型的数据集，为条件化CAD模型生成的深入研究提供支持。

相关内容

CAD

关注 3

《计算机辅助设计》是一份领先的国际期刊，为学术界和工业界提供有关计算机应用于设计的研究和发展的重要论文。计算机辅助设计邀请论文报告新的研究以及新颖或特别重要的应用，在广泛的主题中，跨越所有阶段的设计过程，从概念创造到制造超越。官网地址：http://dblp.uni-trier.de/db/journals/cad/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日