With recent advancements in natural language processing, Large Language Models (LLMs) have emerged as powerful tools for various real-world applications. Despite their prowess, the intrinsic generative abilities of LLMs may prove insufficient for handling complex tasks which necessitate a combination of task planning and the usage of external tools. In this paper, we first propose a structured framework tailored for LLM-based AI Agents and discuss the crucial capabilities necessary for tackling intricate problems. Within this framework, we design two distinct types of agents (i.e., one-step agent and sequential agent) to execute the inference process. Subsequently, we instantiate the framework using various LLMs and evaluate their Task Planning and Tool Usage (TPTU) abilities on typical tasks. By highlighting key findings and challenges, our goal is to provide a helpful resource for researchers and practitioners to leverage the power of LLMs in their AI applications. Our study emphasizes the substantial potential of these models, while also identifying areas that need more investigation and improvement.
翻译:随着自然语言处理领域的最新进展,大语言模型已成为多种实际应用场景中的强大工具。尽管其能力显著,但大语言模型固有的生成能力在处理需要任务规划与外部工具使用相结合的复杂任务时可能仍显不足。本文首先提出了一种专为大语言模型驱动的AI代理设计的结构化框架,并探讨了解决复杂问题所必需的关键能力。在该框架内,我们设计了两种不同类型的代理(即单步代理与序列代理)来执行推理过程。随后,我们利用多种大语言模型实例化该框架,并评估其在典型任务中的任务规划与工具使用能力。通过突出关键发现与挑战,本研究旨在为研究人员和从业者提供有价值的参考,以在AI应用中充分利用大语言模型的潜力。同时,我们的研究强调了这些模型的巨大潜力,并指出了需要进一步探索与改进的方向。