The recent progress of text-to-image generation has been recognized in architectural design. Our study is the first to investigate the potential of text-to-image generators in supporting creativity during the early stages of the architectural design process. We conducted a laboratory study with 17 architecture students, who developed a concept for a culture center using three popular text-to-image generators: Midjourney, Stable Diffusion, and DALL-E. Through standardized questionnaires and group interviews, we found that image generation could be a meaningful part of the design process when design constraints are carefully considered. Generative tools support serendipitous discovery of ideas and an imaginative mindset, enriching the design process. We identified several challenges of image generators and provided considerations for software development and educators to support creativity and emphasize designers' imaginative mindset. By understanding the limitations and potential of text-to-image generators, architects and designers can leverage this technology in their design process and education, facilitating innovation and effective communication of concepts.
翻译:文本到图像生成技术的最新进展已在建筑设计领域得到认可。本研究首次探讨了文本到图像生成器在建筑设计过程早期阶段支持创造力的潜力。我们进行了一项实验室研究,邀请17名建筑系学生,使用三种流行的文本到图像生成器(Midjourney、Stable Diffusion和DALL-E)为文化中心开发概念方案。通过标准化问卷和小组访谈,我们发现当设计约束得到仔细考虑时,图像生成可以成为设计过程中有意义的一部分。生成工具支持偶然发现想法和富有想象力的思维方式,丰富了设计过程。我们识别了图像生成器的若干挑战,并为软件开发者和教育者提供了建议,以支持创造力并强调设计师的想象力思维。通过理解文本到图像生成器的局限性和潜力,建筑师和设计师可以在他们的设计过程和教育中利用这一技术,促进创新和概念的有效沟通。