From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of Thought

How does language inform our downstream thinking? In particular, how do humans make meaning from language -- and how can we leverage a theory of linguistic meaning to build machines that think in more human-like ways? In this paper, we propose \textit{rational meaning construction}, a computational framework for language-informed thinking that combines neural models of language with probabilistic models for rational inference. We frame linguistic meaning as a context-sensitive mapping from natural language into a \textit{probabilistic language of thought} (PLoT) -- a general-purpose symbolic substrate for probabilistic, generative world modeling. Our architecture integrates two powerful computational tools that have not previously come together: we model thinking with \textit{probabilistic programs}, an expressive representation for flexible commonsense reasoning; and we model meaning construction with \textit{large language models} (LLMs), which support broad-coverage translation from natural language utterances to code expressions in a probabilistic programming language. We illustrate our framework in action through examples covering four core domains from cognitive science: probabilistic reasoning, logical and relational reasoning, visual and physical reasoning, and social reasoning about agents and their plans. In each, we show that LLMs can generate context-sensitive translations that capture pragmatically-appropriate linguistic meanings, while Bayesian inference with the generated programs supports coherent and robust commonsense reasoning. We extend our framework to integrate cognitively-motivated symbolic modules to provide a unified commonsense thinking interface from language. Finally, we explore how language can drive the construction of world models themselves.

翻译：语言如何影响我们的深层思维？特别是，人类如何从语言中构建意义——我们又该如何利用语言意义理论来构建更接近人类思维方式的机器？本文提出**理性意义构建**这一计算框架，将神经语言模型与概率推理的理性模型相结合，实现语言驱动的思维方式。我们将语言意义定义为从自然语言到**概率思维语言**（PLoT）的语境敏感映射——这是一种用于概率生成式世界建模的通用符号基底。我们的架构整合了两种此前未曾结合的强大计算工具：我们利用**概率程序**（一种用于灵活常识推理的表达性表征）来建模思维过程；同时利用**大语言模型**（LLMs）进行意义构建，支持从自然语言语句到概率编程语言代码表达式的广泛覆盖翻译。我们通过认知科学四个核心领域的实例展示该框架的应用：概率推理、逻辑与关系推理、视觉与物理推理，以及关于智能体及其计划的社会推理。在每个领域中，我们证明LLMs能够生成捕捉语用恰当语言意义的语境敏感翻译，而基于生成程序的贝叶斯推理则支持连贯且稳健的常识推理。我们进一步扩展框架，整合认知驱动的符号模块，提供从语言出发的统一常识思维接口。最后，我们探讨语言如何驱动世界模型本身的构建。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日