Topologies of Reasoning: Demystifying Chains, Trees, and Graphs of Thoughts

Maciej Besta,Florim Memedi,Zhenyu Zhang,Robert Gerstenberger,Nils Blach,Piotr Nyczyk,Marcin Copik,Grzegorz Kwaśniewski,Jürgen Müller,Lukas Gianinazzi,Ales Kubicek,Hubert Niewiadomski,Onur Mutlu,Torsten Hoefler

The field of natural language processing (NLP) has witnessed significant progress in recent years, with a notable focus on improving large language models' (LLM) performance through innovative prompting techniques. Among these, prompt engineering coupled with structures has emerged as a promising paradigm, with designs such as Chain-of-Thought, Tree of Thoughts, or Graph of Thoughts, in which the overall LLM reasoning is guided by a structure such as a graph. As illustrated with numerous examples, this paradigm significantly enhances the LLM's capability to solve numerous tasks, ranging from logical or mathematical reasoning to planning or creative writing. To facilitate the understanding of this growing field and pave the way for future developments, we devise a general blueprint for effective and efficient LLM reasoning schemes. For this, we conduct an in-depth analysis of the prompt execution pipeline, clarifying and clearly defining different concepts. We then build the first taxonomy of structure-enhanced LLM reasoning schemes. We focus on identifying fundamental classes of harnessed structures, and we analyze the representations of these structures, algorithms executed with these structures, and many others. We refer to these structures as reasoning topologies, because their representation becomes to a degree spatial, as they are contained within the LLM context. Our study compares existing prompting schemes using the proposed taxonomy, discussing how certain design choices lead to different patterns in performance and cost. We also outline theoretical underpinnings, relationships between prompting and others parts of the LLM ecosystem such as knowledge bases, and the associated research challenges. Our work will help to advance future prompt engineering techniques.

翻译：自然语言处理领域近年来取得了显著进展，其中通过创新提示技术提升大型语言模型性能成为关注焦点。在诸多方法中，基于结构化提示的工程范式崭露头角，例如思维链、思维树或思维图等设计，其核心是通过图等结构引导LLM的完整推理过程。大量实例表明，该范式显著增强了LLM在逻辑推理、数学运算、规划编排乃至创意写作等多元任务中的求解能力。为促进该新兴领域的理解并为未来发展铺平道路，我们构建了一套适用于高效LLM推理方案的通用蓝图。为此，我们深入剖析了提示执行流程，厘清并明确定义了相关概念，进而首次建立了结构化增强型LLM推理方案分类体系。研究聚焦于识别所采用结构的基本类别，分析这些结构的表征方式、基于此类结构的算法执行机制及其他关键要素。我们将这些结构称为推理拓扑，因其表征在LLM上下文环境中呈现出一定程度的空间特性。本研究运用所提出的分类体系对现有提示方案进行对比，探讨特定设计选择如何导致性能与成本的不同模式。同时，我们阐述了理论基础、提示与知识库等LLM生态系统其他组成部分的关联，以及相关研究挑战。本工作将推动未来提示工程技术的发展。

相关内容

大语言模型

关注 66

大语言模型是基于海量文本数据训练的深度学习模型。它不仅能够生成自然语言文本，还能够深入理解文本含义，处理各种自然语言任务，如文本摘要、问答、翻译等。2023年，大语言模型及其在人工智能领域的应用已成为全球科技研究的热点，其在规模上的增长尤为引人注目，参数量已从最初的十几亿跃升到如今的一万亿。参数量的提升使得模型能够更加精细地捕捉人类语言微妙之处，更加深入地理解人类语言的复杂性。在过去的一年里，大语言模型在吸纳新知识、分解复杂任务以及图文对齐等多方面都有显著提升。随着技术的不断成熟，它将不断拓展其应用范围，为人类提供更加智能化和个性化的服务，进一步改善人们的生活和生产方式。

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日