Ideation is a critical component of video-based design (VBD), where videos serve as the primary medium for design exploration and inspiration. The emergence of generative AI offers considerable potential to enhance this process by streamlining video analysis and facilitating idea generation. In this paper, we present DesignMinds, a prototype that integrates a state-of-the-art Vision-Language Model (VLM) with a context-enhanced Large Language Model (LLM) to support ideation in VBD. To evaluate DesignMinds, we conducted a between-subject study with 35 design practitioners, comparing its performance to a baseline condition. Our results demonstrate that DesignMinds significantly enhances the flexibility and originality of ideation, while also increasing task engagement. Importantly, the introduction of this technology did not negatively impact user experience, technology acceptance, or usability.
翻译:构思是基于视频的设计(VBD)的关键组成部分,其中视频作为设计探索与灵感激发的主要媒介。生成式人工智能的出现为优化视频分析与促进创意生成提供了巨大潜力。本文提出DesignMinds原型系统,该系统集成先进的视觉语言模型(VLM)与上下文增强的大语言模型(LLM),以支持VBD中的构思过程。为评估DesignMinds,我们开展了一项涉及35名设计从业者的组间对照研究,将其性能与基线条件进行比较。研究结果表明,DesignMinds显著提升了构思的灵活性与原创性,同时增强了任务参与度。重要的是,该技术的引入未对用户体验、技术接受度或可用性产生负面影响。