DiVR：融合多样化虚拟现实场景上下文的人类轨迹预测 (DiVR: incorporating context from diverse VR scenes for human trajectory prediction)

Virtual environments provide a rich and controlled setting for collecting detailed data on human behavior, offering unique opportunities for predicting human trajectories in dynamic scenes. However, most existing approaches have overlooked the potential of these environments, focusing instead on static contexts without considering userspecific factors. Employing the CREATTIVE3D dataset, our work models trajectories recorded in virtual reality (VR) scenes for diverse situations including road-crossing tasks with user interactions and simulated visual impairments. We propose Diverse Context VR Human Motion Prediction (DiVR), a cross-modal transformer based on the Perceiver architecture that integrates both static and dynamic scene context using a heterogeneous graph convolution network. We conduct extensive experiments comparing DiVR against existing architectures including MLP, LSTM, and transformers with gaze and point cloud context. Additionally, we also stress test our model's generalizability across different users, tasks, and scenes. Results show that DiVR achieves higher accuracy and adaptability compared to other models and to static graphs. This work highlights the advantages of using VR datasets for context-aware human trajectory modeling, with potential applications in enhancing user experiences in the metaverse. Our source code is publicly available at https://gitlab.inria.fr/ffrancog/creattive3d-divr-model.

翻译：虚拟环境为收集人类行为的详细数据提供了丰富且受控的场景，为动态场景中的人类轨迹预测提供了独特机遇。然而，现有方法大多忽视了这些环境的潜力，主要关注静态上下文而未考虑用户特定因素。本研究利用CREATTIVE3D数据集，对虚拟现实（VR）场景中记录的多样化情境轨迹进行建模，包括含用户交互的过马路任务及模拟视觉障碍场景。我们提出多样化上下文VR人体运动预测模型（DiVR），这是一种基于Perceiver架构的跨模态Transformer，通过异构图卷积网络整合静态与动态场景上下文。我们进行了大量实验，将DiVR与现有架构（包括MLP、LSTM及结合视线点云上下文的Transformer）进行比较，并针对模型在不同用户、任务和场景间的泛化能力进行了压力测试。结果表明，相较于其他模型及静态图方法，DiVR实现了更高的预测精度与适应性。本研究凸显了利用VR数据集进行上下文感知人体轨迹建模的优势，在提升元宇宙用户体验方面具有潜在应用价值。源代码已公开于https://gitlab.inria.fr/ffrancog/creattive3d-divr-model。

相关内容

关注 23

IEEE虚拟现实会议一直是展示虚拟现实(VR)广泛领域研究成果的主要国际场所，包括增强现实（AR），混合现实（MR）和3D用户界面中寻求高质量的原创论文。每篇论文应归类为主要涵盖研究，应用程序或系统，并使用以下准则进行分类：研究论文应描述有助于先进软件，硬件，算法，交互或人为因素发展的结果。应用论文应解释作者如何基于现有思想并将其应用到以新颖的方式解决有趣的问题。每篇论文都应包括对给定应用领域中VR/AR/MR使用成功的评估。官网地址：http://dblp.uni-trier.de/db/conf/vr/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日