From Word Models to World Models: Translating from Natural Language to the Probabilistic Language of Thought

How does language inform our downstream thinking? In particular, how do humans make meaning from language--and how can we leverage a theory of linguistic meaning to build machines that think in more human-like ways? In this paper, we propose rational meaning construction, a computational framework for language-informed thinking that combines neural language models with probabilistic models for rational inference. We frame linguistic meaning as a context-sensitive mapping from natural language into a probabilistic language of thought (PLoT)--a general-purpose symbolic substrate for generative world modeling. Our architecture integrates two computational tools that have not previously come together: we model thinking with probabilistic programs, an expressive representation for commonsense reasoning; and we model meaning construction with large language models (LLMs), which support broad-coverage translation from natural language utterances to code expressions in a probabilistic programming language. We illustrate our framework through examples covering four core domains from cognitive science: probabilistic reasoning, logical and relational reasoning, visual and physical reasoning, and social reasoning. In each, we show that LLMs can generate context-sensitive translations that capture pragmatically-appropriate linguistic meanings, while Bayesian inference with the generated programs supports coherent and robust commonsense reasoning. We extend our framework to integrate cognitively-motivated symbolic modules (physics simulators, graphics engines, and planning algorithms) to provide a unified commonsense thinking interface from language. Finally, we explore how language can drive the construction of world models themselves. We hope this work will provide a roadmap towards cognitive models and AI systems that synthesize the insights of both modern and classical computational perspectives.

翻译：语言如何影响我们的深层思考？具体而言，人类如何从语言中构建意义——我们又该如何利用语言学意义理论，以更接近人类的方式构建具有思考能力的机器？本文提出了一种理性意义构建的计算框架，该框架融合了神经语言模型与用于理性推断的概率模型，旨在实现语言驱动的思维过程。我们将语言意义定义为从自然语言到概率思维语言（PLoT）的语境敏感映射——PLoT是一种用于生成式世界建模的通用符号基质。本文架构整合了两种此前未曾结合的计算工具：我们利用概率程序（一种用于常识推理的富有表达力的表征方式）对思考进行建模，并借助大型语言模型（LLM）实现意义构建——LLM能够支持从自然语言语句到概率编程语言中代码表达式的广泛覆盖翻译。我们通过涵盖认知科学四大核心领域的示例来阐释本框架：概率推理、逻辑与关系推理、视觉与物理推理，以及社会推理。在每个领域，我们证明LLM能够生成捕捉实用恰当语言意义的语境敏感翻译，同时，基于生成程序的贝叶斯推理可支持连贯且稳健的常识推理。我们将框架扩展至整合具有认知动机的符号模块（物理模拟器、图形引擎和规划算法），从而提供从语言出发的统一常识思考接口。最后，我们探讨语言如何驱动世界模型本身的构建。希望这项工作能为融合现代与经典计算视角的认知模型及人工智能系统提供路线图。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日