How do we evaluate experiences in immersive environments? Despite decades of research in immersive technologies such as virtual reality, the field remains fragmented. Studies rely on overlapping constructs, heterogeneous instruments, and little agreement on what counts as immersive experience. To better understand this landscape, we conducted a bottom-up scoping review of 375 papers published in ACM CHI, UIST, VRST, SUI, IEEE VR, ISMAR, and TVCG. Our analysis reveals that evaluation practices are often domain- and purpose-specific, shaped more by local choices than by shared standards. Yet this diversity also points to new directions. Instead of multiplying instruments, researchers benefit from integrating and refining them into smarter measures. Rather than focusing only on system outputs, evaluations must center the user's lived experience. Computational modeling offers opportunities to bridge signals across methods, but lasting progress requires open and sustainable evaluation practices that support comparability and reuse. Ultimately, our contribution is to map current practices and outline a forward-looking agenda for immersive experience research.
翻译:如何评估沉浸式环境中的体验?尽管虚拟现实等沉浸式技术已历经数十年研究,该领域仍呈现碎片化状态。现有研究依赖重叠的概念建构、异质化的测量工具,且对何为沉浸式体验缺乏共识。为深入理解这一现状,我们对发表于ACM CHI、UIST、VRST、SUI、IEEE VR、ISMAR及TVCG的375篇文献进行了自下而上的范围界定综述。分析表明,评估实践往往具有领域特定性和目的特定性,更多由局部选择而非共享标准所塑造。然而这种多样性也揭示了新的方向:与其不断扩充测量工具,研究者更应通过整合与精炼构建更智能的度量体系;评估不应仅关注系统输出,而须以用户真实体验为核心;计算建模为跨方法信号融合提供了可能,但持久进展需要建立支持可比性与可复用的开放、可持续评估实践。本研究最终通过梳理当前实践版图,为沉浸式体验研究勾勒出前瞻性发展路径。