Inspired by the dual-process theory of human cognition, we introduce DUMA, a novel conversational agent framework that embodies a dual-mind mechanism through the utilization of two generative Large Language Models (LLMs) dedicated to fast and slow thinking respectively. The fast thinking model serves as the primary interface for external interactions and initial response generation, evaluating the necessity for engaging the slow thinking model based on the complexity of the complete response. When invoked, the slow thinking model takes over the conversation, engaging in meticulous planning, reasoning, and tool utilization to provide a well-analyzed response. This dual-mind configuration allows for a seamless transition between intuitive responses and deliberate problem-solving processes based on the situation. We have constructed a conversational agent to handle online inquiries in the real estate industry. The experiment proves that our method balances effectiveness and efficiency, and has a significant improvement compared to the baseline.
翻译:受人类认知双过程理论启发,我们提出了DUMA——一种新型对话智能体框架。该框架通过分别专攻快思维与慢思维的两个生成式大语言模型(LLMs)实现双心智机制。快思维模型作为外部交互与初始响应生成的主界面,基于完整响应复杂度评估是否需要调用慢思维模型。当被激活时,慢思维模型接管对话,通过缜密规划、推理及工具调用来提供深度分析后的响应。这种双心智配置使系统能根据情境在直觉响应与审慎问题解决过程间无缝切换。我们构建了面向房地产行业在线咨询的对话智能体,实验证明该方法在保持效果与效率平衡的同时,相较于基线模型实现了显著提升。