AI agents have been evaluated in isolation or within small groups, where interactions remain limited in scope and complexity. Large-scale simulations involving many autonomous agents -- reflecting the full spectrum of civilizational processes -- have yet to be explored. Here, we demonstrate how 10 - 1000+ AI agents behave and progress within agent societies. We first introduce the PIANO (Parallel Information Aggregation via Neural Orchestration) architecture, which enables agents to interact with humans and other agents in real-time while maintaining coherence across multiple output streams. We then evaluate agent performance in agent simulations using civilizational benchmarks inspired by human history. These simulations, set within a Minecraft environment, reveal that agents are capable of meaningful progress -- autonomously developing specialized roles, adhering to and changing collective rules, and engaging in cultural and religious transmission. These preliminary results show that agents can achieve significant milestones towards AI civilizations, opening new avenues for large simulations, agentic organizational intelligence, and integrating AI into human civilizations.
翻译:当前AI智能体的评估通常局限于孤立个体或小规模群体,其交互的范围与复杂性均受到限制。尚未有研究探索涉及大量自主智能体的大规模模拟——这种模拟能够反映文明进程的完整谱系。本文展示了10至1000个以上AI智能体在智能体社会中的行为模式与发展进程。我们首先提出了PIANO(基于神经编排的并行信息聚合)架构,该架构使智能体能够与人类及其他智能体实时交互,同时保持跨多个输出流的一致性。随后,我们采用受人类历史启发的文明基准指标,在智能体模拟环境中评估其表现。这些在《我的世界》环境中运行的模拟表明,智能体能够实现有意义的进展——包括自主发展专业化角色、遵守并改变集体规则,以及参与文化与宗教传播。这些初步结果表明,智能体能够达成通向AI文明的重大里程碑,为大规模模拟、智能体组织智能以及AI与人类文明的融合开辟了新途径。