How do we evaluate experiences in immersive environments? Despite decades of research in immersive technologies such as virtual reality, the field remains fragmented. Studies rely on overlapping constructs, heterogeneous instruments, and little agreement on what counts as immersive experience. To better understand this landscape, we conducted a bottom-up scoping review of 375 papers published in ACM CHI, UIST, VRST, SUI, IEEE VR, ISMAR, and TVCG. Our analysis reveals that evaluation practices are often domain- and purpose-specific, shaped more by local choices than by shared standards. Yet this diversity also points to new directions. Instead of multiplying instruments, researchers benefit from integrating and refining them into smarter measures. Rather than focusing only on system outputs, evaluations must center the user's lived experience. Computational modeling offers opportunities to bridge signals across methods, but lasting progress requires open and sustainable evaluation practices that support comparability and reuse. Ultimately, our contribution is to map current practices and outline a forward-looking agenda for immersive experience research.
翻译:我们如何评估沉浸式环境中的体验?尽管虚拟现实等沉浸式技术已历经数十年研究,该领域仍处于碎片化状态。现有研究依赖于重叠的建构概念、异质化的测量工具,且对何为沉浸式体验缺乏共识。为深入理解这一现状,我们对发表于ACM CHI、UIST、VRST、SUI、IEEE VR、ISMAR及TVCG的375篇文献进行了自下而上的范围界定综述。分析表明,评估实践往往具有领域特定性和目的特定性,更多受局部选择而非共享标准的影响。然而这种多样性也揭示了新的发展方向:研究者不应继续增加测量工具的数量,而应通过整合与精炼构建更智能的测量体系;评估重点需从系统输出转向用户的实际体验;计算建模为跨方法信号融合提供了可能,但实现持久进展需要建立开放、可持续的评估实践以支持可比性与可复用性。最终,本文的贡献在于系统梳理当前实践,并为沉浸式体验研究提出前瞻性发展路径。