NavAI：面向虚拟现实环境中导航任务的可泛化大语言模型框架 (NavAI: A Generalizable LLM Framework for Navigation Tasks in Virtual Reality Environments)

Navigation is one of the fundamental tasks for automated exploration in Virtual Reality (VR). Existing technologies primarily focus on path optimization in 360-degree image datasets and 3D simulators, which cannot be directly applied to immersive VR environments. To address this gap, we present NavAI, a generalizable large language model (LLM)-based navigation framework that supports both basic actions and complex goal-directed tasks across diverse VR applications. We evaluate NavAI in three distinct VR environments through goal-oriented and exploratory tasks. Results show that it achieves high accuracy, with an 89% success rate in goal-oriented tasks. Our analysis also highlights current limitations of relying entirely on LLMs, particularly in scenarios that require dynamic goal assessment. Finally, we discuss the limitations observed during the experiments and offer insights for future research directions.

翻译：导航是虚拟现实（VR）中自动化探索的基本任务之一。现有技术主要集中于360度图像数据集与三维模拟器中的路径优化，无法直接应用于沉浸式VR环境。为弥补这一空白，本文提出NavAI——一个基于大语言模型（LLM）的可泛化导航框架，其支持跨不同VR应用的基本动作与复杂目标导向任务。我们在三个不同的VR环境中通过目标导向型任务与探索型任务对NavAI进行评估。实验结果表明，该框架在目标导向任务中取得了89%的成功率，展现出较高的准确性。我们的分析同时揭示了当前完全依赖大语言模型的局限性，尤其是在需要进行动态目标评估的场景中。最后，我们讨论了实验过程中观察到的不足，并对未来研究方向提出了展望。

相关内容

关注 23

IEEE虚拟现实会议一直是展示虚拟现实(VR)广泛领域研究成果的主要国际场所，包括增强现实（AR），混合现实（MR）和3D用户界面中寻求高质量的原创论文。每篇论文应归类为主要涵盖研究，应用程序或系统，并使用以下准则进行分类：研究论文应描述有助于先进软件，硬件，算法，交互或人为因素发展的结果。应用论文应解释作者如何基于现有思想并将其应用到以新颖的方式解决有趣的问题。每篇论文都应包括对给定应用领域中VR/AR/MR使用成功的评估。官网地址：http://dblp.uni-trier.de/db/conf/vr/

实时无人机指令处理：一种面向无人机系统的大语言模型方法

专知会员服务

16+阅读 · 2025年10月24日

《战术训练虚拟士兵：一种用于自适应军事模拟的生成式人工智能框架》最新文献

专知会员服务

25+阅读 · 2025年9月24日

【新书】大语言模型提示工程：构建基于大语言模型应用的艺术与科学

专知会员服务

82+阅读 · 2024年11月30日

RSS 2024 | NaVid：视觉语言导航大模型

专知会员服务

34+阅读 · 2024年6月9日