Thanks to their generative capabilities, large language models (LLMs) have become an invaluable tool for creative processes. These models have the capacity to produce hundreds and thousands of visual and textual outputs, offering abundant inspiration for creative endeavors. But are we harnessing their full potential? We argue that current interaction paradigms fall short, guiding users towards rapid convergence on a limited set of ideas, rather than empowering them to explore the vast latent design space in generative models. To address this limitation, we propose a framework that facilitates the structured generation of design space in which users can seamlessly explore, evaluate, and synthesize a multitude of responses. We demonstrate the feasibility and usefulness of this framework through the design and development of an interactive system, Luminate, and a user study with 8 professional writers. Our work advances how we interact with LLMs for creative tasks, introducing a way to harness the creative potential of LLMs.
翻译:得益于其生成能力,大语言模型已成为创意过程中不可或缺的工具。这些模型能够生成成千上万的视觉和文本输出,为创意活动提供丰富的灵感。然而,我们是否充分发挥了它们的潜力?我们认为,当前的交互范式存在不足,它们引导用户迅速收敛于有限的想法集合,而非赋能用户去探索生成模型中蕴含的广阔潜在设计空间。为解决这一局限,我们提出一个框架,用于促进设计空间的结构化生成,使用户能够无缝地探索、评估并综合大量响应。我们通过设计并开发交互系统Luminate,以及一项针对8名专业写作者的用戶研究,验证了该框架的可行性与实用性。本研究推进了我们为创意任务与大语言模型交互的方式,引入了一种方法以充分释放大语言模型的创造潜力。