We present LingBot-World, an open-sourced world simulator stemming from video generation. Positioned as a top-tier world model, LingBot-World offers the following features. (1) It maintains high fidelity and robust dynamics in a broad spectrum of environments, including realism, scientific contexts, cartoon styles, and beyond. (2) It enables a minute-level horizon while preserving contextual consistency over time, which is also known as "long-term memory". (3) It supports real-time interactivity, achieving a latency of under 1 second when producing 16 frames per second. We provide public access to the code and model in an effort to narrow the divide between open-source and closed-source technologies. We believe our release will empower the community with practical applications across areas like content creation, gaming, and robot learning.
翻译:我们提出了LingBot-World,一个源自视频生成的开源世界模拟器。作为顶级世界模型,LingBot-World具备以下特性。(1) 它在广泛的环境中保持高保真度和强健的动态特性,涵盖写实场景、科学情境、卡通风格及其他领域。(2) 它支持分钟级的时间跨度,同时保持跨时间步的上下文一致性,即具备“长期记忆”能力。(3) 它实现了实时交互性,在以每秒16帧生成视频时延迟低于1秒。我们公开了代码与模型,旨在缩小开源与闭源技术之间的差距。我们相信,此次发布将为社区在内容创作、游戏开发和机器人学习等领域带来实际应用价值。