The development of large language models (LLMs) has brought unprecedented possibilities for artificial intelligence (AI) based medical diagnosis. However, the application perspective of LLMs in real diagnostic scenarios is still unclear because they are not adept at collecting patient data proactively. This study presents a LLM-based diagnostic system that enhances planning capabilities by emulating doctors. Our system involves two external planners to handle planning tasks. The first planner employs a reinforcement learning approach to formulate disease screening questions and conduct initial diagnoses. The second planner uses LLMs to parse medical guidelines and conduct differential diagnoses. By utilizing real patient electronic medical record data, we constructed simulated dialogues between virtual patients and doctors and evaluated the diagnostic abilities of our system. We demonstrated that our system obtained impressive performance in both disease screening and differential diagnoses tasks. This research represents a step towards more seamlessly integrating AI into clinical settings, potentially enhancing the accuracy and accessibility of medical diagnostics.
翻译:大语言模型(LLMs)的发展为基于人工智能(AI)的医疗诊断带来了前所未有的可能性。然而,LLMs在实际诊断场景中的应用前景仍不明朗,因为它们不擅长主动收集患者数据。本研究提出一种基于LLM的诊断系统,通过模拟医生来增强规划能力。我们的系统引入两个外部规划器处理规划任务:第一个规划器采用强化学习方法制定疾病筛查问题并进行初步诊断;第二个规划器利用LLMs解析医疗指南并实施鉴别诊断。通过使用真实患者的电子病历数据,我们构建了虚拟患者与医生之间的模拟对话,并对系统的诊断能力进行了评估。实验表明,我们的系统在疾病筛查与鉴别诊断任务中均取得了令人瞩目的性能。本研究标志着向临床环境中更无缝整合AI迈进一步,有望提升医疗诊断的准确性和可及性。