Therapeutic art activities, such as expressive drawing and painting, require the synergy between creative visual production and interactive dialogue. Recent advancements in Multimodal Large Language Models (MLLMs) have expanded the capacity of computing systems to interpret both textual and visual data, offering a new frontier for AI-mediated therapeutic support. This work-in-progress paper introduces an MLLM-powered chatbot that analyzes visual creation in real-time while engaging the creator in reflective conversations. We conducted an evaluation with five experts in art therapy and related fields, which demonstrated the chatbot's potential to facilitate therapeutic engagement, and highlighted several areas for future development, including entryways and risk management, bespoke alignment of user profile and therapeutic style, balancing conversational depth and width, and enriching visual interactivity. These themes provide a design roadmap for designing the future AI-mediated creative expression tools.
翻译:治疗性艺术活动,如表达性绘画与涂色,需要创造性视觉产出与互动对话之间的协同作用。多模态大语言模型(MLLMs)的最新进展扩展了计算系统解析文本与视觉数据的能力,为人工智能介导的治疗支持开辟了新前沿。这篇进行中的研究论文介绍了一款由MLLM驱动的聊天机器人,它能够实时分析视觉创作,同时引导创作者进行反思性对话。我们邀请了五位艺术治疗及相关领域的专家进行评估,结果表明该聊天机器人具有促进治疗性参与的潜力,并指出了未来发展的若干方向,包括介入途径与风险管理、用户画像与治疗风格的定制化匹配、对话深度与广度的平衡,以及视觉交互性的丰富。这些主题为设计未来人工智能介导的创造性表达工具提供了设计路线图。