Autonomous AI agents are beginning to operate across organizational boundaries on the open internet -- discovering, transacting with, and delegating to agents owned by other parties without centralized oversight. When agents from different human principals collaborate at scale, the collective becomes opaque: no single human can observe, audit, or govern the emergent behavior. We term this the Logic Monopoly -- the agent society's unchecked monopoly over the entire logic chain from planning through execution to evaluation. We propose the Separation of Power (SoP) model, a constitutional governance architecture deployed on public blockchain that breaks this monopoly through three structural separations: agents legislate operational rules as smart contracts, deterministic software executes within those contracts, and humans adjudicate through a complete ownership chain binding every agent to a responsible principal. In this architecture, smart contracts are the law itself -- the actual legislative output that agents produce and that governs their behavior. We instantiate SoP in AgentCity on an EVM-compatible layer-2 blockchain (L2) with a three-tier contract hierarchy (foundational, meta, and operational). The core thesis is alignment-through-accountability: if each agent is aligned with its human owner through the accountability chain, then the collective converges on behavior aligned with human intent -- without top-down rules. A pre-registered experiment evaluates this thesis in a commons production economy -- where agents share a finite resource pool and collaboratively produce value -- at 50-1,000 agent scale.
翻译:[摘要]:自主AI智能体正开始在开放互联网上跨组织边界运作——它们在无需集中监管的情况下,发现、交易并委托其他方拥有的智能体。当来自不同人类主体的智能体大规模协作时,其集体行为变得不透明:没有任何单个个体能观察、审计或治理这种涌现行为。我们将其称为逻辑垄断——智能体社会对从规划、执行到评估的整个逻辑链拥有不受制约的垄断权。为此,我们提出权力分立(SoP)模型,这是一种部署在公共区块链上的宪制治理架构,通过三种结构性分立打破这种垄断:智能体将运行规则立法为智能合约,确定性软件在合约框架内执行,人类则通过完整的归属链(将每个智能体绑定至负责任的主体)进行裁决。在该架构中,智能合约即是法律本身——由智能体制定并约束其行为的实际立法输出。我们在兼容EVM的L2区块链上实例化AgentCity实现SoP,构建包含三层合约层级(基础层、元层和运行层)的体系。核心论点是"通过问责实现对齐":若每个智能体通过问责链与其人类所有者对齐,则集体行为将收敛于符合人类意图的模式——无需自上而下的规则。一项预注册实验在50-1000智能体规模下,于公有生产经济(智能体共享有限资源池并协作创造价值)中验证了这一论点。