Montague semantics and modifier consistency measurement in neural language models

This work proposes a novel methodology for measuring compositional behavior in contemporary language embedding models. Specifically, we focus on adjectival modifier phenomena in adjective-noun phrases. In recent years, distributional language representation models have demonstrated great practical success. At the same time, the need for interpretability has elicited questions on their intrinsic properties and capabilities. Crucially, distributional models are often inconsistent when dealing with compositional phenomena in natural language, which has significant implications for their safety and fairness. Despite this, most current research on compositionality is directed towards improving their performance on similarity tasks only. This work takes a different approach, introducing three novel tests of compositional behavior inspired by Montague semantics. Our experimental results indicate that current neural language models do not behave according to the expected linguistic theories. This indicates that current language models may lack the capability to capture the semantic properties we evaluated on limited context, or that linguistic theories from Montagovian tradition may not match the expected capabilities of distributional models.

翻译：本研究提出了一种新颖的方法论，用于衡量当代语言嵌入模型中的组合行为。具体而言，我们聚焦于形容词-名词短语中的形容词修饰现象。近年来，分布式语言表征模型已展现出巨大的实际成功。与此同时，对可解释性的需求引发了关于其内在属性与能力的疑问。关键在于，分布式模型在处理自然语言的组合现象时常常表现出不一致性，这对模型的安全性与公平性具有重要影响。尽管如此，当前大多数关于组合性的研究仅致力于提升模型在相似性任务上的性能。本研究采用了一种不同的路径，引入了三种受蒙塔古语义学启发的新型组合行为测试。我们的实验结果表明，当前的神经语言模型并未按照预期的语言学理论运作。这表明当前的语言模型可能缺乏在有限上下文中捕捉我们所评估的语义属性的能力，或者源自蒙塔古传统的语言学理论可能与分布式模型的预期能力不相匹配。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【AI应用】Facebook-利用神经网络求解高等数学方程, Using neural networks to solve advanced mathematics equations

专知会员服务

34+阅读 · 2020年1月15日