Scene2Hap：基于多模态大语言模型从场景上下文生成VR场景级触觉反馈 (Scene2Hap: Generating Scene-Wide Haptics for VR from Scene Context with Multimodal LLMs)

Haptic feedback contributes to immersive virtual reality (VR) experiences. However, designing such feedback at scale for all objects within a VR scene remains time-consuming. We present Scene2Hap, an LLM-centered system that automatically designs object-level vibrotactile feedback for entire VR scenes based on the objects' semantic attributes and physical context. Scene2Hap employs a multimodal large language model to estimate each object's semantics and physical context, including its material properties and vibration behavior, from multimodal information in the VR scene. These estimated attributes are then used to generate or retrieve audio signals, subsequently converted into plausible vibrotactile signals. For more realistic spatial haptic rendering, Scene2Hap estimates vibration propagation and attenuation from vibration sources to neighboring objects, considering the estimated material properties and spatial relationships of virtual objects in the scene. Three user studies confirm that Scene2Hap successfully estimates the vibration-related semantics and physical context of VR scenes and produces realistic vibrotactile signals.

翻译：触觉反馈有助于提升虚拟现实（VR）体验的沉浸感。然而，为VR场景中的所有对象大规模设计此类反馈仍然耗时。我们提出了Scene2Hap，这是一个以大语言模型（LLM）为中心的系统，能够根据对象的语义属性与物理上下文，为整个VR场景自动设计对象级的振动触觉反馈。Scene2Hap采用多模态大语言模型，从VR场景的多模态信息中估计每个对象的语义和物理上下文，包括其材料属性与振动行为。这些估计的属性随后被用于生成或检索音频信号，并进一步转换为合理的振动触觉信号。为了实现更真实的空间触觉渲染，Scene2Hap会估计振动从振源到邻近对象的传播与衰减，该过程考虑了场景中虚拟对象的估计材料属性及其空间关系。三项用户研究证实，Scene2Hap能够成功估计VR场景中与振动相关的语义和物理上下文，并生成逼真的振动触觉信号。

相关内容

关注 23

IEEE虚拟现实会议一直是展示虚拟现实(VR)广泛领域研究成果的主要国际场所，包括增强现实（AR），混合现实（MR）和3D用户界面中寻求高质量的原创论文。每篇论文应归类为主要涵盖研究，应用程序或系统，并使用以下准则进行分类：研究论文应描述有助于先进软件，硬件，算法，交互或人为因素发展的结果。应用论文应解释作者如何基于现有思想并将其应用到以新颖的方式解决有趣的问题。每篇论文都应包括对给定应用领域中VR/AR/MR使用成功的评估。官网地址：http://dblp.uni-trier.de/db/conf/vr/

《缓解大语言模型（LLMs）幻觉：面向应用的检索增强生成（RAG）、推理与智能体系统综述》

专知会员服务

23+阅读 · 2025年10月29日

【CVPR2025】MASH-VLM：通过解耦时空表征缓解视频大语言模型中的动作-场景幻觉问题

专知会员服务

13+阅读 · 2025年3月23日

大型视觉语言模型中幻觉现象的综述

专知会员服务

47+阅读 · 2024年10月24日

视频大模型中视觉上下文表示的scaling law

专知会员服务

24+阅读 · 2024年10月21日