Emergent communication and learning pressures in language models: a language evolution perspective

Language models and humans are two types of learning systems. Finding or facilitating commonalities could enable major breakthroughs in our understanding of the acquisition and evolution of language. Many theories of language evolution rely heavily on learning biases and learning pressures. Yet due to substantial differences in learning pressures, it is questionable whether the similarity between humans and machines is sufficient for insights to carry over and to be worth testing with human participants. Here, we review the emergent communication literature, a subfield of multi-agent reinforcement learning, from a language evolution perspective. We find that the emergent communication literature excels at designing and adapting models to recover initially absent linguistic phenomena of natural languages. Based on a short literature review, we identify key pressures that have recovered initially absent human patterns in emergent communication models: communicative success, efficiency, learnability, and other psycho-/sociolinguistic factors. We argue that this may serve as inspiration for how to design language models for language acquisition and language evolution research.

翻译：语言模型和人类是两种学习系统。寻找或促进两者的共通之处，可能为理解语言习得与演化带来重大突破。许多语言演化理论高度依赖于学习偏差与学习压力。然而，由于学习压力存在显著差异，人机相似性是否足以使相关洞见具有可迁移性并值得通过人类被试进行检验，仍存疑问。本文从语言演化视角综述了涌现沟通文献——多智能体强化学习的一个子领域。我们发现，涌现沟通文献擅长设计与调整模型，以恢复自然语言中原本缺失的语言现象。基于简短的文献综述，我们识别出在涌现沟通模型中已成功恢复人类初始缺失模式的关键压力因素：沟通成功、效率、可学习性及其他心理/社会语言因素。我们主张，这可为如何设计适用于语言习得与语言演化研究的语言模型提供启发。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【ACL2021】认知启发的时序知识图谱两阶段推理模型

专知会员服务

46+阅读 · 2021年8月6日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日