There is a sensory gulf between the Earth that humans inhabit and the digital realms in which modern AI agents are created. To develop AI agents that can sense, think, and act as flexibly as humans in real-world settings, it is imperative to bridge the realism gap between the digital and physical worlds. How can we embody agents in an environment as rich and diverse as the one we inhabit, without the constraints imposed by real hardware and control? Towards this end, we introduce V-IRL: a platform that enables agents to scalably interact with the real world in a virtual yet realistic environment. Our platform serves as a playground for developing agents that can accomplish various practical tasks and as a vast testbed for measuring progress in capabilities spanning perception, decision-making, and interaction with real-world data across the entire globe.
翻译:人类居住的地球与创建现代AI代理的数字领域之间存在感官鸿沟。为了开发能够像人类一样灵活感知、思考并在现实环境中行动的AI代理,弥合数字世界与物理世界之间的真实性差距至关重要。如何在不受实际硬件和控制约束的情况下,让代理在与我们居住环境一样丰富多样的环境中具身化?为此,我们提出V-IRL:一个使代理能够在虚拟但逼真的环境中可扩展地与现实世界互动的平台。该平台既可作为开发能完成各类实际任务代理的试验场,也可作为衡量全球范围感知、决策及与现实数据交互等能力进展的广袤测试基地。