Information graphics are pivotal in effective information dissemination and storytelling. However, creating such graphics is extremely challenging for non-professionals, since the design process requires multifaceted skills and comprehensive knowledge. Thus, despite the many available authoring tools, a significant gap remains in enabling non-experts to produce compelling information graphics seamlessly, especially from scratch. Recent breakthroughs show that Large Language Models (LLMs), especially when tool-augmented, can autonomously engage with external tools, making them promising candidates for enabling innovative graphic design applications. In this work, we propose a LLM-centric interface with the agent GraphiMind for automatic generation, recommendation, and composition of information graphics design resources, based on user intent expressed through natural language. Our GraphiMind integrates a Textual Conversational Interface, powered by tool-augmented LLM, with a traditional Graphical Manipulation Interface, streamlining the entire design process from raw resource curation to composition and refinement. Extensive evaluations highlight our tool's proficiency in simplifying the design process, opening avenues for its use by non-professional users. Moreover, we spotlight the potential of LLMs in reshaping the domain of information graphics design, offering a blend of automation, versatility, and user-centric interactivity.
翻译:信息图形在有效信息传播与叙事中至关重要。然而,设计此类图形对非专业人士极具挑战,因为设计过程需要多面技能与综合知识。因此,尽管存在众多创作工具,但让非专家用户无缝生成高质量信息图形(尤其是从零开始)仍存在显著鸿沟。近期突破表明,大型语言模型(LLM),特别是当增强工具能力后,可自主与外部工具交互,成为实现创新图形设计应用的理想候选。本文提出一种以LLM为中心的界面,其智能体GraphiMind基于用户通过自然语言表达的意图,自动生成、推荐与组合信息图形设计资源。我们的GraphiMind将基于工具增强LLM驱动的文本对话界面与传统图形操作界面相融合,精简从原始资源整理到组合优化的全设计流程。广泛评估凸显了本工具简化设计流程的能力,为非专业用户应用开辟了新途径。此外,我们揭示了LLM在重塑信息图形设计领域的潜力——融合自动化、多功能性与用户中心交互。