TPE: Towards Better Compositional Reasoning over Conceptual Tools with Multi-persona Collaboration

Large language models (LLMs) have demonstrated exceptional performance in planning the use of various functional tools, such as calculators and retrievers, particularly in question-answering tasks. In this paper, we expand the definition of these tools, centering on conceptual tools within the context of dialogue systems. A conceptual tool specifies a cognitive concept that aids systematic or investigative thought. These conceptual tools play important roles in practice, such as multiple psychological or tutoring strategies being dynamically applied in a single turn to compose helpful responses. To further enhance the reasoning and planning capability of LLMs with these conceptual tools, we introduce a multi-persona collaboration framework: Think-Plan-Execute (TPE). This framework decouples the response generation process into three distinct roles: Thinker, Planner, and Executor. Specifically, the Thinker analyzes the internal status exhibited in the dialogue context, such as user emotions and preferences, to formulate a global guideline. The Planner then generates executable plans to call different conceptual tools (e.g., sources or strategies), while the Executor compiles all intermediate results into a coherent response. This structured approach not only enhances the explainability and controllability of responses but also reduces token redundancy. We demonstrate the effectiveness of TPE across various dialogue response generation tasks, including multi-source (FoCus) and multi-strategy interactions (CIMA and PsyQA). This reveals its potential to handle real-world dialogue interactions that require more complicated tool learning beyond just functional tools. The full code and data will be released for reproduction.

翻译：大型语言模型在规划使用计算器、检索器等多种功能性工具方面展现出卓越性能，尤其在问答任务中表现突出。本文拓展了工具的定义范畴，聚焦对话系统中的概念性工具。概念性工具指代辅助系统性或探究性思维的认知概念，这些工具在实践中具有重要作用——例如在同一轮对话中动态应用多种心理学或教学策略以构建有益回应。为进一步提升语言模型使用概念工具进行推理和规划的能力，我们提出多角色协作框架：思考-规划-执行（Think-Plan-Execute, TPE）。该框架将响应生成过程解耦为三个独立角色：思考者、规划者和执行者。具体而言，思考者通过分析对话语境中的内在状态（如用户情绪与偏好）制定全局准则；规划者据此生成可执行方案以调用不同概念工具（如信息源或策略）；执行者则将中间结果整合为连贯响应。这种结构化方法不仅增强了响应的可解释性与可控性，还降低了令牌冗余。我们在多源交互（FoCus）与多策略交互（CIMA和PsyQA）等对话响应生成任务中验证了TPE的有效性，揭示了该方法在需要超越功能性工具的复合工具学习的真实对话交互中的应用潜力。完整代码与数据将公开发布以供复现。

相关内容

TOOLS

关注 1

这个新版本的工具会议系列恢复了从1989年到2012年的50个会议的传统。工具最初是“面向对象语言和系统的技术”，后来发展到包括软件技术的所有创新方面。今天许多最重要的软件概念都是在这里首次引入的。2019年TOOLS 50+1在俄罗斯喀山附近举行，以同样的创新精神、对所有与软件相关的事物的热情、科学稳健性和行业适用性的结合以及欢迎该领域所有趋势和社区的开放态度，延续了该系列。官网链接：http://tools2019.innopolis.ru/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日