This paper delves into the capabilities of large language models (LLMs), specifically focusing on advancing the theoretical comprehension of chain-of-thought prompting. We investigate how LLMs can be effectively induced to generate a coherent chain of thoughts. To achieve this, we introduce a two-level hierarchical graphical model tailored for natural language generation. Within this framework, we establish a compelling geometrical convergence rate that gauges the likelihood of an LLM-generated chain of thoughts compared to those originating from the true language. Our findings provide a theoretical justification for the ability of LLMs to produce the correct sequence of thoughts (potentially) explaining performance gains in tasks demanding reasoning skills.
翻译:本文深入探讨大型语言模型(LLM)的能力,重点在于推进对思维链提示理论层面的理解。我们研究了如何有效诱导LLM生成连贯的思维链。为此,我们引入了一个专为自然语言生成设计的两级层次化图模型。在该框架下,我们建立了一个令人信服的几何收敛速率,用以衡量LLM生成的思维链相对于真实语言生成思维链的可能性。我们的研究结果为LLM生成正确思维序列的能力提供了理论依据,这(可能)解释了它们在需要推理能力的任务中性能提升的原因。