While large language models have accelerated software development through "vibe coding", prototyping intelligent Extended Reality (XR) experiences remains inaccessible due to the friction of complex game engines and low-level sensor integration. To bridge this gap, we contribute XR Blocks, an open-source, modular WebXR framework that abstracts spatial computing complexities into high-level, human-centered primitives. Building upon this foundation, we present Vibe Coding XR, an end-to-end rapid prototyping workflow that leverages LLMs to translate natural language intent directly into functional XR software. Using a web-based interface, creators can transform high-level prompts (e.g., "create a dandelion that reacts to hand") into interactive WebXR applications in under a minute. We provide a preliminary technical evaluation on a pilot dataset (VCXR60) alongside diverse application scenarios highlighting mixed-reality realism, multi-modal interaction, and generative AI integrations. By democratizing spatial software creation, this work empowers practitioners to bypass low-level hurdles and rapidly move from "idea to reality." Code and live demos are available at https://xrblocks.github.io/gem and https://github.com/google/xrblocks.
翻译:尽管大语言模型已通过“氛围编程”加速了软件开发,但智能扩展现实(XR)体验的原型设计仍因复杂游戏引擎和底层传感器集成的摩擦而难以普及。为弥合这一鸿沟,我们提出了XR Blocks——一个开源、模块化的WebXR框架,将空间计算的复杂性抽象为高层级、以人为中心的原语。在此基础上,我们进一步提出Vibe Coding XR——一个端到端的快速原型开发工作流,利用大语言模型将自然语言意图直接转化为可运行的XR软件。借助基于Web的界面,创作者可在不到一分钟内将高层级提示(例如“生成一朵对手部动作产生反应的蒲公英”)转化为交互式WebXR应用。我们在初步数据集(VCXR60)上完成了技术评估,并展示了涵盖混合现实真实感、多模态交互与生成式AI整合的多样化应用场景。通过推进空间软件的民主化进程,本工作使从业者能够绕过底层障碍,快速实现从“创意到现实”的跨越。代码与在线演示详见https://xrblocks.github.io/gem 及 https://github.com/google/xrblocks。