通信赋能LLM智能体协作：与基于课程学习方法的比较 (Communication Enables Cooperation in LLM Agents: A Comparison with Curriculum-Based Approaches)

Eliciting cooperation in multi-agent LLM systems is critical for AI alignment. We investigate two approaches: direct communication and curriculum learning. In a 4-player Stag Hunt, a one-word "cheap talk" channel increases cooperation from 0% to 48.3%, demonstrating communication as a robust coordination mechanism. In contrast, we find that curriculum learning is highly sensitive to design choices: our pedagogical curriculum through progressively complex games reduced agent payoffs by 27.4% in an Iterated Public Goods Game with Punishment, demonstrating that optimizing for short-term rationality can actively undermine alignment goals. Qualitative analysis reveals that curricula emphasizing defection-equilibrium games can induce "learned pessimism" in agents. These findings suggest that for coordination problems, simple communication protocols may be more reliable than experience-based training, and that curriculum design for social dilemmas requires careful attention to the strategic lessons embedded in game sequences.

翻译：在多智能体LLM系统中激发协作是实现AI对齐的关键。本研究探讨两种方法：直接通信与课程学习。在4参与者猎鹿博弈中，仅增加单词语义的低成本通信通道将合作率从0%提升至48.3%，证明通信是有效的协调机制。相比之下，我们发现课程学习对设计选择高度敏感：通过渐进复杂博弈设计的教学课程，在带惩罚机制的迭代公共物品博弈中使智能体收益降低27.4%，这表明针对短期理性的优化可能直接破坏对齐目标。定性分析显示，强调背叛均衡博弈的课程会诱发智能体的"习得性悲观"。这些发现表明，对于协调问题，简单的通信协议可能比基于经验的训练更可靠，且针对社会困境的课程设计需审慎考量博弈序列中嵌入的策略性经验。

相关内容

课程

关注 6

课程是指学校学生所应学习的学科总和及其进程与安排。课程是对教育的目标、教学内容、教学活动方式的规划和设计，是教学计划、教学大纲等诸多方面实施过程的总和。广义的课程是指学校为实现培养目标而选择的教育内容及其进程的总和，它包括学校老师所教授的各门学科和有目的、有计划的教育活动。狭义的课程是指某一门学科。专知上对国内外最新AI+X的课程进行了收集与索引，涵盖斯坦福大学、CMU、MIT、清华、北大等名校开放课程。

多智能体通信：多智能体强化学习到涌现语言和大语言模型的综述

专知会员服务

11+阅读 · 2月13日

多智能体强化学习中的稳健且高效的通信

专知会员服务

25+阅读 · 2025年11月17日

LLM/智能体作为数据分析师：综述

专知会员服务

36+阅读 · 2025年9月30日

大语言模型驱动的AI智能体通信综述：协议、安全风险与防御对策

专知会员服务

29+阅读 · 2025年6月25日