Design space exploration (DSE) for Text-to-Image (TTI) models entails navigating a vast, opaque space of possible image outputs, through a commensurately vast input space of hyperparameters and prompt text. Minor adjustments to prompt input can surface unexpectedly disparate images. How can interfaces support end-users in reliably steering prompt-space explorations towards interesting results? Our design probe, DreamSheets, supports exploration strategies with LLM-based functions for assisted prompt construction and simultaneous display of generated results, hosted in a spreadsheet interface. The flexible layout and novel generative functions enable experimentation with user-defined workflows. Two studies, a preliminary lab study and a longitudinal study with five expert artists, revealed a set of strategies participants use to tackle the challenges of TTI design space exploration, and the interface features required to support them - like using text-generation to define local "axes" of exploration. We distill these insights into a UI mockup to guide future interfaces.
翻译:文本到图像模型的"设计空间探索"需要用户在庞大的超参数与提示文本输入空间中导航,以探索同样广阔且不透明的可能图像输出空间。提示输入的微小调整可能导致意想不到的差异图像生成。如何设计界面以支持终端用户可靠地引导提示空间探索,从而获得有趣的结果?我们的设计探索工具DreamSheets采用电子表格界面,通过基于大语言模型的辅助提示构建功能与生成结果同步显示,支持用户探索策略。灵活的布局与新颖的生成功能使用户能够自主定义工作流程进行实验。通过初步实验室研究与五位专家艺术家的纵向研究,我们揭示了参与者应对文本到图像设计空间探索挑战时采用的一系列策略,以及支撑这些策略所需的界面特性(例如利用文本生成定义局部探索"轴")。我们将这些见解提炼为界面原型设计指南,为未来界面开发提供参考。