Clarify Before You Draw: Proactive Agents for Robust Text-to-CAD Generation

Large language models have recently enabled text-to-CAD systems that synthesize parametric CAD programs (e.g., CadQuery) from natural-language prompts. In practice, however, geometric descriptions can be under-specified or internally inconsistent: critical dimensions may be missing and constraints may conflict. However, existing fine-tuned models tend to reactively follow the user instructions and hallucinate dimensions when the text is ambiguous. To address this, we propose a proactive agentic framework for text-to-CadQuery generation, named as ProCAD, that resolves specification issues before code synthesis. Our framework pairs a proactive clarifying agent, which audits the prompt and asks targeted clarification questions only when necessary to produce a self-consistent specification, with a CAD coding agent that translates the specification into an executable CadQuery program. We fine-tune the coding agent based on a curated high-quality text-to-CadQuery dataset and train the clarifying agent via agentic SFT on clarification trajectories. Experiments show that proactive clarification significantly improves robustness to ambiguous prompts while keeping interaction overhead low. ProCAD outperforms frontier closed-source models, including Claude Sonnet 4.5, reducing the mean Chamfer distance by 79.9% and lowering the invalidity ratio from 4.8% to 0.9%. Our code and datasets are made publicly available on https://github.com/BoYuanVisionary/Pro-CAD.

翻译：大语言模型最近使得文本到CAD系统成为可能，这类系统能从自然语言提示合成参数化CAD程序（例如CadQuery）。然而在实践中，几何描述往往存在欠指定或内部不一致的问题：关键尺寸可能缺失，约束条件可能冲突。现有的微调模型倾向于被动遵循用户指令，并在文本模糊时臆测尺寸。为解决这一问题，我们提出了一种面向文本到CadQuery生成的主动式智能体框架，命名为ProCAD，该框架在代码合成前解决规格问题。我们的框架将主动式澄清智能体与CAD编码智能体配对：前者审查提示并仅在必要时提出针对性澄清问题以生成自洽的规格，后者将规格转换为可执行的CadQuery程序。我们基于精心策划的高质量文本到CadQuery数据集微调编码智能体，并通过在澄清轨迹上进行主动式监督微调训练澄清智能体。实验表明，主动式澄清显著提升了系统对模糊提示的鲁棒性，同时保持了较低的交互开销。ProCAD超越了包括Claude Sonnet 4.5在内的前沿闭源模型，将平均倒角距离降低了79.9%，并将无效率从4.8%降至0.9%。我们的代码和数据集已在https://github.com/BoYuanVisionary/Pro-CAD上公开。

相关内容

CAD

关注 3

《计算机辅助设计》是一份领先的国际期刊，为学术界和工业界提供有关计算机应用于设计的研究和发展的重要论文。计算机辅助设计邀请论文报告新的研究以及新颖或特别重要的应用，在广泛的主题中，跨越所有阶段的设计过程，从概念创造到制造超越。官网地址：http://dblp.uni-trier.de/db/journals/cad/

可信智能体AI综述：安全、鲁棒性、隐私与系统安全

专知会员服务

18+阅读 · 5月27日

大语言模型智能体（LLM Agents）工具调用的演进：从单工具调用到多工具协同编排

专知会员服务

29+阅读 · 4月6日

从静态模板到动态运行时图：大语言模型智能体（LLM Agents）工作流优化综述

专知会员服务

23+阅读 · 3月30日

大型语言模型遇上文本属性图：一种融合框架与应用的综述

专知会员服务

10+阅读 · 2025年10月27日