BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation

Large language models (LLMs) have catalyzed a paradigm shift in natural language processing, yet their limited controllability poses a significant challenge for downstream applications. We aim to address this by drawing inspiration from the neural mechanisms of the human brain, specifically Broca's and Wernicke's areas, which are crucial for language generation and comprehension, respectively. In particular, Broca's area receives cognitive decision signals from Wernicke's area, treating the language generation as an intricate decision-making process, which differs from the fully auto-regressive language generation of existing LLMs. In a similar vein, our proposed system, the BWArea model, conceptualizes language generation as a decision-making task. This model has three components: a language world model, an inverse dynamics model, and a cognitive policy. Like Wernicke's area, the inverse dynamics model is designed to deduce the underlying cognitive intentions, or latent actions, behind each token. The BWArea model is amenable to both pre-training and fine-tuning like existing LLMs. With 30B clean pre-training tokens, we have trained a BWArea model, which achieves competitive performance with LLMs of equal size (1B parameters). Unlike fully auto-regressive LLMs, its pre-training performance does not degenerate if dirty data unintentionally appears. This shows the advantage of a decomposed structure of BWArea model in reducing efforts in laborious data selection and labeling. Finally, we reveal that the BWArea model offers enhanced controllability via fine-tuning the cognitive policy with downstream reward metrics, thereby facilitating alignment with greater simplicity. On 9 out of 10 tasks from two suites, TextWorld and BigBench Hard, our method shows superior performance to auto-regressive LLMs.

翻译：大型语言模型（LLMs）已引发自然语言处理领域的范式转变，但其有限的可控性为下游应用带来了重大挑战。为解决这一问题，我们受人类大脑神经机制的启发，特别是布洛卡区与韦尼克区——它们分别对语言生成与理解至关重要。具体而言，布洛卡区接收来自韦尼克区的认知决策信号，将语言生成视为复杂的决策过程，这与现有LLMs完全自回归的语言生成方式不同。类似地，我们提出的系统——BWArea模型——将语言生成概念化为决策任务。该模型包含三个组件：语言世界模型、逆动力学模型与认知策略。与韦尼克区类似，逆动力学模型旨在推断每个词元背后潜在的认知意图（即隐式动作）。BWArea模型与现有LLMs同样适用于预训练与微调。基于300亿个洁净预训练词元，我们训练了一个BWArea模型，其性能与同等规模（10亿参数）的LLMs相当。与完全自回归的LLMs不同，即使无意中出现脏数据，其预训练性能也不会退化。这体现了BWArea模型分解式结构在减少繁琐数据选择与标注工作方面的优势。最后，我们证明BWArea模型可通过基于下游奖励指标微调认知策略来增强可控性，从而以更简化的方式实现对齐。在TextWorld和BigBench Hard两个测试集的共10项任务中，我们的方法在9项任务上表现出优于自回归LLMs的性能。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日