Laboratory experiments have shown that communication plays an important role in solving social dilemmas. Here, by extending the AI-Economist, a mixed motive multi-agent reinforcement learning environment, I intend to find an answer to the following descriptive question: which governing system does facilitate the emergence and evolution of communication and teaching among agents? To answer this question, the AI-Economist is extended by a voting mechanism to simulate three different governing systems across individualistic-collectivistic axis, from full-libertarian to Full-Utilitarian governing systems. Moreover, the AI-Economist is further extended to include communication with possible misalignment, a variant of signalling game, by letting agents to build houses together if they are able to name mutually complement material resources by the same letter. Moreover, another extension is made to the AI-Economist to include teaching with possible misalignment, again a variant of signalling game, by letting half the agents as teachers who know how to use mutually complement material resources to build houses but are not capable of building actual houses, and the other half as students who do not have this information but are able to actually build those houses if teachers teach them. I found a strong evidence that collectivistic environment such as Full-Utilitarian system is more favourable for the emergence of communication and teaching, or more precisely, evolution of language alignment. Moreover, I found some evidence that evolution of language alignment through communication and teaching under collectivistic governing systems makes individuals more advantageously inequity averse. As a result, there is a positive correlation between evolution of language alignment and equality in the society.
翻译:实验室实验表明,沟通在解决社会困境中起着重要作用。本文通过扩展混合动机多智能体强化学习环境AI-Economist,旨在回答以下描述性问题:哪种治理体系更有利于智能体之间沟通与教学的产生和演化?为回答这一问题,我们在AI-Economist中引入投票机制,模拟个体主义-集体主义轴上三种不同的治理体系,从完全自由意志主义到完全功利主义。此外,我们进一步扩展AI-Economist,将可能存在偏差的沟通(一种信号博弈变体)纳入其中,允许智能体在使用相同字母命名互补性物质资源时共同建造房屋。同时,我们对AI-Economist进行另一项扩展,引入可能存在偏差的教学(同样是一种信号博弈变体):让一半智能体作为教师(知晓如何利用互补性物质资源建造房屋但无法实际建造),另一半作为学生(不具备该信息但在教师教导下能实际建造房屋)。强证据表明,集体主义环境(如完全功利主义体系)更有利于沟通与教学的产生,更准确地说,更有利于语言对齐的演化。此外,我们发现有证据表明,在集体主义治理体系下,通过沟通与教学演化的语言对齐能使个体产生更强的有利不平等厌恶倾向。因此,语言对齐的演化与社会平等之间存在正相关关系。