Inspired by the dual-process theory of human cognition, we introduce DUMA, a novel conversational agent framework that embodies a dual-mind mechanism through the utilization of two generative Large Language Models (LLMs) dedicated to fast and slow thinking respectively. The fast thinking model serves as the primary interface for external interactions and initial response generation, evaluating the necessity for engaging the slow thinking model based on the complexity of the complete response. When invoked, the slow thinking model takes over the conversation, engaging in meticulous planning, reasoning, and tool utilization to provide a well-analyzed response. This dual-mind configuration allows for a seamless transition between intuitive responses and deliberate problem-solving processes based on the situation. We have constructed a conversational agent to handle online inquiries in the real estate industry. The experiment proves that our method balances effectiveness and efficiency, and has a significant improvement compared to the baseline.
翻译:受人类认知双过程理论的启发,我们提出DUMA——一种新颖的对话代理框架,通过利用两个分别专攻快、慢思考的生成式大型语言模型(LLMs)来体现双智机制。快速思考模型作为外部交互和初始响应生成的主要接口,根据完整响应的复杂度评估是否需要调用慢速思考模型。一旦被激活,慢速思考模型将接管对话,通过细致的规划、推理与工具使用来提供经过深度分析的回应。这种双智配置允许根据情境在直觉式响应与深思熟虑的问题解决流程之间实现无缝切换。我们构建了一个处理房地产行业在线咨询的对话代理。实验证明,该方法在有效性与效率之间实现了平衡,且相较于基线方法具有显著提升。