How language models extrapolate outside the training data: A case study in Textualized Gridworld

Language models' ability to extrapolate learned behaviors to novel, more complex environments beyond their training scope is highly unknown. This study introduces a path planning task in a textualized Gridworld to probe language models' extrapolation capabilities. We show that conventional approaches, including next token prediction and Chain of Thought (CoT) finetuning, fail to extrapolate in larger, unseen environments. Inspired by human cognition and dual process theory, we propose cognitive maps for path planning, a novel CoT framework that simulates humanlike mental representations. Our experiments show that cognitive maps not only enhance extrapolation to unseen environments but also exhibit humanlike characteristics through structured mental simulation and rapid adaptation. Our finding that these cognitive maps require specialized training schemes and cannot be induced through simple prompting opens up important questions about developing general-purpose cognitive maps in language models. Our comparison with exploration-based methods further illuminates the complementary strengths of offline planning and online exploration.

翻译：语言模型将习得行为泛化至训练范围之外的新颖、更复杂环境的能力尚不明确。本研究通过在文本化网格世界中引入路径规划任务，以探究语言模型的泛化能力。我们发现包括下一词预测和思维链（CoT）微调在内的传统方法，在更大规模的未见环境中均无法实现有效泛化。受人类认知与双过程理论启发，我们提出用于路径规划的认知地图——一种模拟人类心智表征的新型CoT框架。实验表明，认知地图不仅能提升对未见环境的泛化性能，还能通过结构化心智模拟与快速适应机制展现类人认知特征。研究同时发现，此类认知地图需要专门训练方案且无法通过简单提示激发，这为开发语言模型中通用认知地图提出了重要课题。与基于探索的方法对比进一步揭示了离线规划与在线探索能力的互补优势。

相关内容

Cognition

关注 4

Cognition：Cognition：International Journal of Cognitive Science Explanation：认知：国际认知科学杂志。 Publisher：Elsevier。 SIT： http://www.journals.elsevier.com/cognition/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日