Understanding the Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation

Pre-trained language models (LMs) are able to perform complex reasoning without explicit fine-tuning. To understand how pre-training with a next-token prediction objective contributes to the emergence of such reasoning capability, we propose that we can view an LM as deriving new conclusions by aggregating indirect reasoning paths seen at pre-training time. We found this perspective effective in two important cases of reasoning: logic reasoning with knowledge graphs (KGs) and math reasoning with math word problems (MWPs). More specifically, we formalize the reasoning paths as random walk paths on the knowledge/reasoning graphs. Analyses of learned LM distributions suggest that a weighted sum of relevant random walk path probabilities is a reasonable way to explain how LMs reason. Experiments and analysis on multiple KG and MWP datasets reveal the effect of training on random walk paths and suggest that augmenting unlabeled random walk reasoning paths can improve real-world multi-step reasoning performance.

翻译：预训练语言模型（LMs）无需显式微调即可执行复杂推理。为理解基于下一词预测目标的大规模预训练如何催生这种推理能力，我们提出可将语言模型视为通过聚合预训练阶段观察到的间接推理路径来推导新结论。该视角在两类重要推理场景中效果显著：基于知识图谱（KGs）的逻辑推理与基于数学应用题（MWPs）的数学推理。具体而言，我们将推理路径形式化为知识/推理图上的随机游走路径。对学习到的语言模型分布的分析表明，相关随机游走路径概率的加权求和是解释LM推理机制的合理方式。在多个知识图谱和数学应用题数据集上的实验与分析揭示了随机游走路径训练的影响，并表明通过增强未标注的随机游走推理路径可提升实际多步推理性能。

相关内容

随机漫步

关注 1

在数学中，随机漫步是一种数学对象，称为随机过程或随机过程，它描述的路径由在某些数学空间（例如整数）上的一系列随机步骤组成。随机行走等是指基于过去的表现，无法预测将来的发展步骤和方向。核心概念是指任何无规则行走者所带的守恒量都各自对应着一个扩散运输定律，接近于布朗运动，是布朗运动理想的数学状态，现阶段主要应用于互联网链接分析及金融股票市场中。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

语言视觉预训练语言模型揭密，Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

专知会员服务

36+阅读 · 2020年5月20日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日