Mapping Human Anti-collusion Mechanisms to Multi-agent AI Systems

As multi-agent AI systems become increasingly autonomous, evidence shows they can develop collusive strategies similar to those long observed in human markets and institutions. While human domains have accumulated centuries of anti-collusion mechanisms, it remains unclear how these can be adapted to AI settings. This paper addresses that gap by (i) developing a taxonomy of human anti-collusion mechanisms, including sanctions, leniency & whistleblowing, monitoring & auditing, market design, and governance and (ii) mapping them to potential interventions for multi-agent AI systems. For each mechanism, we propose implementation approaches. We also highlight open challenges, such as the attribution problem (difficulty attributing emergent coordination to specific agents), identity fluidity (agents being easily forked or modified), the boundary problem (distinguishing beneficial cooperation from harmful collusion), and adversarial adaptation (agents learning to evade detection).

翻译：随着多智能体AI系统日益自主化，证据表明它们可能发展出类似人类市场和机构中长期观察到的共谋策略。尽管人类领域积累了数百年的反共谋机制，但如何将这些机制适用于AI环境仍不明确。本文通过以下方式填补这一空白：（i）构建人类反共谋机制的分类体系，包括制裁、宽大与举报制度、监控与审计、市场设计及治理机制；（ii）将这些机制映射到多智能体AI系统的潜在干预措施中。针对每种机制，我们提出实施路径。同时，本文强调当前存在的开放性挑战，例如归因问题（难以将涌现的协调行为归因于特定智能体）、身份流动性（智能体容易被分支或修改）、边界问题（区分有益合作与有害共谋）以及对抗性适应（智能体学会规避检测）。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

《多智能体系统中人与自主系统协作的工程化共享领导力》276页

专知会员服务

23+阅读 · 3月23日

《联合目标打击周期中人工智能决策支持系统的应用：挑战、风险及“人”的维度》2026最新45页报告

专知会员服务

35+阅读 · 2月6日

《基于AI的动态任务分配策略实现多智能体系统有意义人类控制》报告

专知会员服务

44+阅读 · 2025年12月26日

协同智能体：多智能体人工智能系统如何变革军事训练及其他领域

专知会员服务

35+阅读 · 2025年9月20日