While language models are powerful and versatile, they often fail to address highly complex problems. This is because solving complex problems requires deliberate thinking, which has been only minimally guided during training. In this paper, we propose a new method called Cumulative Reasoning (CR), which employs language models in a cumulative and iterative manner to emulate human thought processes. By decomposing tasks into smaller components, \ournameb streamlines the problem-solving process, rendering it both more manageable and effective. For logical inference tasks, CR consistently outperforms existing methods with an improvement up to 9.3\%, and achieves the astonishing accuracy of 98.04\% on the curated FOLIO wiki dataset. In the context of the Game of 24, CR achieves an accuracy of 94\%, which signifies a substantial enhancement of 20\% over the previous state-of-the-art method.
翻译:尽管语言模型功能强大且通用,但在应对高度复杂问题时往往表现不佳。这是因为解决复杂问题需要审慎思考,而这一过程在训练中仅得到极少的指导。本文提出一种名为"累积推理"(Cumulative Reasoning, CR)的新方法,该方法以累积迭代的方式运用语言模型来模拟人类思维过程。通过将任务分解为更小的组成部分,CR简化了问题解决流程,使其既更易管理又更高效。在逻辑推理任务中,CR始终优于现有方法,性能提升最高达9.3%,并在精心整理的FOLIO wiki数据集上实现了98.04%的惊人准确率。在"24点游戏"背景下,CR达到了94%的准确率,相比先前最先进的方法实现了20%的显著提升。