MoT: Memory-of-Thought Enables ChatGPT to Self-Improve

Large Language Models (LLMs) have shown impressive abilities in various tasks. However, fundamentally improving them depends on high-quality datasets or computationally expensive fine-tuning. On the contrary, humans can easily improve themselves by self-thinking and memory, without external resources. In this paper, we propose a framework, MoT, to let the LLM self-improve through Memory-of-Thought, without annotated datasets and parameter updates. Specifically, MoT is divided into two stages: 1. before the test stage, the LLM pre-thinks on the unlabeled dataset and saves the high-confidence thoughts as external memory; 2. During the test stage, given a test question, the LLM recalls relevant memory to help itself reason and answer it. Experimental results show that MoT can help ChatGPT significantly improve its abilities in arithmetic reasoning, commonsense reasoning, factual reasoning, and natural language inference. Further analyses show that each component contributes critically to the improvements and MoT can lead to consistent improvements across various CoT methods and LLMs.

翻译：大规模语言模型（LLMs）在各种任务中表现出令人印象深刻的能力。然而，从根本上改进它们依赖于高质量数据集或计算代价高昂的微调。相比之下，人类可以通过自我思考和记忆轻松实现自我改进，无需借助外部资源。本文提出一种名为MoT的框架，通过记忆思维（Memory-of-Thought）让LLM实现自我改进，无需标注数据集和参数更新。具体而言，MoT分为两个阶段：1）在测试阶段之前，LLM在无标签数据集上预思考，并将高置信度的思考结果保存为外部记忆；2）在测试阶段，给定测试问题时，LLM回忆相关记忆以辅助自身推理和作答。实验结果表明，MoT能帮助ChatGPT在算术推理、常识推理、事实推理和自然语言推断等多个任务中显著提升能力。进一步分析表明，每个组件对改进均有关键贡献，且MoT可在多种CoT方法和LLM上实现一致的性能提升。

相关内容

ChatGPT

关注 258

ChatGPT（全名：Chat Generative Pre-trained Transformer），美国OpenAI 研发的聊天机器人程序 [1] ，于2022年11月30日发布。ChatGPT是人工智能技术驱动的自然语言处理工具，它能够通过学习和理解人类的语言来进行对话，还能根据聊天的上下文进行互动，真正像人类一样来聊天交流，甚至能完成撰写邮件、视频脚本、文案、翻译、代码，写论文任务。 [1] https://openai.com/blog/chatgpt/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日