Neural MMO 2.0 is a massively multi-agent environment for reinforcement learning research. The key feature of this new version is a flexible task system that allows users to define a broad range of objectives and reward signals. We challenge researchers to train agents capable of generalizing to tasks, maps, and opponents never seen during training. Neural MMO features procedurally generated maps with 128 agents in the standard setting and support for up to. Version 2.0 is a complete rewrite of its predecessor with three-fold improved performance and compatibility with CleanRL. We release the platform as free and open-source software with comprehensive documentation available at neuralmmo.github.io and an active community Discord. To spark initial research on this new platform, we are concurrently running a competition at NeurIPS 2023.
翻译:Neural MMO 2.0 是一个用于强化学习研究的大规模多智能体环境。本版本的核心特性在于其灵活的任务系统,允许用户定义广泛的目标与奖励信号。我们向研究人员提出挑战:训练智能体具备泛化能力,使其能够应对训练中从未见过的任务、地图与对手。Neural MMO 支持程序化生成地图,标准设定下包含128个智能体,并可扩展至更高数量。2.0版本对前代进行了彻底重写,性能提升三倍,并与CleanRL实现兼容。我们将该平台以自由开源软件形式发布,提供完整文档(访问地址:neuralmmo.github.io),并设有活跃的社区Discord频道。为激发该新平台的初期研究,我们同步在NeurIPS 2023举办竞赛。