Diagrams are crucial for communicating complex information, yet creating and modifying them remains a labor-intensive task. We present GenAI-DrawIO-Creator, a novel framework that leverages Large Language Models (LLMs) to automate diagram generation and manipulation in the structured XML format used by draw.io. Our system integrates Claude 3.7 to reason about structured visual data and produce valid diagram representations. Key contributions include a high-level system design enabling real-time diagram updates, specialized prompt engineering and error-checking to ensure well-formed XML outputs. We demonstrate a working prototype capable of generating accurate diagrams (such as network architectures and flowcharts) from natural language or code, and even replicating diagrams from images. Simulated evaluations show that our approach significantly reduces diagram creation time and produces outputs with high structural fidelity. Our results highlight the promise of Claude 3.7 in handling structured visual reasoning tasks and lay the groundwork for future research in AI-assisted diagramming applications.
翻译:图表对于传达复杂信息至关重要,但创建和修改图表仍然是一项劳动密集型任务。我们提出了GenAI-DrawIO-Creator,这是一种新颖的框架,利用大型语言模型(LLMs)自动生成和操作draw.io所使用的结构化XML格式图表。我们的系统集成了Claude 3.7,用于推理结构化视觉数据并生成有效的图表表示。主要贡献包括:支持实时图表更新的高级系统设计、专门的提示工程和错误检查机制,以确保生成格式良好的XML输出。我们展示了一个工作原型,该原型能够根据自然语言或代码生成准确的图表(例如网络架构和流程图),甚至能够从图像中复制图表。模拟评估表明,我们的方法显著减少了图表创建时间,并生成了具有高结构保真度的输出。我们的结果凸显了Claude 3.7在处理结构化视觉推理任务方面的潜力,并为未来AI辅助图表绘制应用的研究奠定了基础。