A multi-agent reinforcement learning model of reputation and cooperation in human groups

Kevin R. McKee,Edward Hughes,Tina O. Zhu,Martin J. Chadwick,Raphael Koster,Antonio Garcia Castaneda,Charlie Beattie,Thore Graepel,Matt Botvinick,Joel Z. Leibo

Collective action demands that individuals efficiently coordinate how much, where, and when to cooperate. Laboratory experiments have extensively explored the first part of this process, demonstrating that a variety of social-cognitive mechanisms influence how much individuals choose to invest in group efforts. However, experimental research has been unable to shed light on how social cognitive mechanisms contribute to the where and when of collective action. We build and test a computational model of human behavior in Clean Up, a social dilemma task popular in multi-agent reinforcement learning research. We show that human groups effectively cooperate in Clean Up when they can identify group members and track reputations over time, but fail to organize under conditions of anonymity. A multi-agent reinforcement learning model of reputation demonstrates the same difference in cooperation under conditions of identifiability and anonymity. In addition, the model accurately predicts spatial and temporal patterns of group behavior: in this public goods dilemma, the intrinsic motivation for reputation catalyzes the development of a non-territorial, turn-taking strategy to coordinate collective action.

翻译：集体行动要求个体高效协调合作的数量、地点与时机。实验室实验已深入探究该过程的前半部分，证明多种社会认知机制会影响个体在集体努力中投入的资源数量。然而，实验研究未能阐明社会认知机制如何促进集体行动中地点与时机的协调。我们构建并测试了人类在"清洁任务"（Clean Up）这一多智能体强化学习研究领域常见的社会困境任务中的行为计算模型。研究表明，当人类群体能够识别成员身份并随时间追踪声誉时，他们能在此任务中有效合作；但匿名条件下则无法组织起来。基于声誉的多智能体强化学习模型在可识别与匿名条件下呈现出相同的合作差异。此外，该模型准确预测了群体行为的时空模式：在此公共品困境中，声誉的内在动机催化了非领地性轮换策略的形成，以协调集体行动。

相关内容

GROUP

关注 1

Group一直是研究计算机支持的合作工作、人机交互、计算机支持的协作学习和社会技术研究的主要场所。该会议将社会科学、计算机科学、工程、设计、价值观以及其他与小组工作相关的多个不同主题的工作结合起来，并进行了广泛的概念化。官网链接：https://group.acm.org/conferences/group20/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

116+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日