Large Language Models (LLMs) have achieved remarkable success in reasoning tasks with the development of prompting methods. However, existing prompting approaches cannot reuse insights of solving similar problems and suffer from accumulated errors in multi-step reasoning, since they prompt LLMs to reason \textit{from scratch}. To address these issues, we propose \textbf{\textit{Thought Propagation} (TP)}, which explores the analogous problems and leverages their solutions to enhance the complex reasoning ability of LLMs. These analogous problems are related to the input one, with reusable solutions and problem-solving strategies. Thus, it is promising to propagate insights of solving previous analogous problems to inspire new problem-solving. To achieve this, TP first prompts LLMs to propose and solve a set of analogous problems that are related to the input one. Then, TP reuses the results of analogous problems to directly yield a new solution or derive a knowledge-intensive plan for execution to amend the initial solution obtained from scratch. TP is compatible with existing prompting approaches, allowing plug-and-play generalization and enhancement in a wide range of tasks without much labor in task-specific prompt engineering. Experiments across three challenging tasks demonstrate TP enjoys a substantial improvement over the baselines by an average of 12\% absolute increase in finding the optimal solutions in Shortest-path Reasoning, 13\% improvement of human preference in Creative Writing, and 15\% enhancement in the task completion rate of LLM-Agent Planning.
翻译:大语言模型(LLMs)随着提示方法的发展在推理任务中取得了显著成功。然而,现有提示方法无法重复利用解决相似问题的经验,且在多步推理中因从零开始推理而累积错误。为解决这些问题,我们提出**思路传播(Thought Propagation,TP)**,该方法探索类比问题并利用其解决方案来增强LLMs的复杂推理能力。这些类比问题与输入问题相关,具有可复用的解决方案和解题策略。因此,传播先前类比问题的解决思路以启发新问题的求解具有可行性。为实现此目标,TP首先提示LLMs提出并解决一组与输入问题相关的类比问题,随后复用类比问题的结果,直接生成新解决方案或推导出知识密集型执行计划,以修正从零开始获得的初始解决方案。TP与现有提示方法兼容,支持即插即用的泛化与增强,可在广泛任务中无需大量任务特定提示工程。三个挑战性任务的实验表明,TP相比基线方法取得显著提升:最短路径推理中寻找最优解的平均绝对提升达12%,创意写作中人类偏好提升13%,LLM智能体规划的任务完成率提升15%。