Extended reality (XR) enables new music-mixing workflows by moving beyond 2D faders toward embodied, spatial interaction. However, it remains unclear which six-degree-of-freedom (6DoF) gestures align with real-world mixing practices and whether such interactions support manageable cognitive load and positive user experience. We conducted a design workshop with experienced mixers to elicit gesture concepts for core audio tasks gain, compression, equalization, and automation, and implemented these in an XR prototype. A user study (n=12) evaluated the ecological validity of the gestures using cognitive load measures, user-experience ratings, and interviews. Participants generally found 6DoF gestures intuitive and well-mapped to mixing tasks, reporting strong immersion and a sense of connection with the audio environment. Cognitive load differences across gestures were minimal, though participants expressed preferences shaped by workflow familiarity and perceived control. We discuss implications for designing XR mixing tools that balance expressiveness, precision, and ecological validity.
翻译:扩展现实(XR)通过超越二维推子,实现具身化、空间化的交互,为音乐混音工作流带来了新的可能性。然而,目前尚不清楚哪些六自由度(6DoF)手势符合实际混音实践,以及此类交互是否能够支持可控的认知负荷与积极的用户体验。我们与经验丰富的混音师开展了一场设计研讨会,以激发针对核心音频任务(增益、压缩、均衡与自动化)的手势概念,并在一个XR原型中实现了这些手势。一项用户研究(n=12)通过认知负荷测量、用户体验评分及访谈,评估了这些手势的生态效度。参与者普遍认为6DoF手势直观且与混音任务映射良好,并报告了强烈的沉浸感以及与音频环境的连接感。不同手势间的认知负荷差异较小,但参与者的偏好受到工作流熟悉度和感知控制度的影响。我们讨论了设计XR混音工具时在表现力、精确度和生态效度之间取得平衡的启示。