Graphic design is a creative and innovative process that plays a crucial role in applications such as e-commerce and advertising. However, developing an automated design system that can faithfully translate user intentions into editable design files remains an open challenge. Although recent studies have leveraged powerful text-to-image models and MLLMs to assist graphic design, they typically simplify professional workflows, resulting in limited flexibility and intuitiveness. To address these limitations, we propose PSDesigner, an automated graphic design system that emulates the creative workflow of human designers. Building upon multiple specialized components, PSDesigner collects theme-related assets based on user instructions, and autonomously infers and executes tool calls to manipulate design files, such as integrating new assets or refining inferior elements. To endow the system with strong tool-use capabilities, we construct a design dataset, CreativePSD, which contains a large amount of high-quality PSD design files annotated with operation traces across a wide range of design scenarios and artistic styles, enabling models to learn expert design procedures. Extensive experiments demonstrate that PSDesigner outperforms existing methods across diverse graphic design tasks, empowering non-specialists to conveniently create production-quality designs.
翻译:图形设计是一种富有创造性和创新性的过程,在电子商务和广告等应用中发挥着至关重要的作用。然而,开发一个能够忠实地将用户意图转化为可编辑设计文件的自动化设计系统,仍然是一个开放性的挑战。尽管近期研究已利用强大的文生图模型和多模态大语言模型(MLLMs)来辅助图形设计,但这些方法通常简化了专业工作流程,导致灵活性和直观性受限。为解决这些局限,我们提出了PSDesigner——一种模拟人类设计师创意工作流程的自动化图形设计系统。该系统基于多个专业化组件,根据用户指令收集与主题相关的素材,并自主推断和执行工具调用来操作设计文件,例如整合新素材或优化劣质元素。为使系统具备强大的工具使用能力,我们构建了一个设计数据集CreativePSD,其中包含大量标注有操作轨迹的高质量PSD设计文件,覆盖广泛的设计场景和艺术风格,使模型能够学习专家级的设计流程。大量实验表明,PSDesigner在多种图形设计任务上优于现有方法,使非专业人士也能便捷地创作出具有专业品质的设计作品。