Human-AI Co-design for Clinical Prediction Models

Developing safe, effective, and practically useful clinical prediction models (CPMs) traditionally requires iterative collaboration between clinical experts, data scientists, and informaticists. This process refines the often small but critical details of the model building process, such as which features/patients to include and how clinical categories should be defined. However, this traditional collaboration process is extremely time- and resource-intensive, resulting in only a small fraction of CPMs reaching clinical practice. This challenge intensifies when teams attempt to incorporate unstructured clinical notes, which can contain an enormous number of concepts. To address this challenge, we introduce HACHI, an iterative human-in-the-loop framework that uses AI agents to accelerate the development of fully interpretable CPMs by enabling the exploration of concepts in clinical notes. HACHI alternates between (i) an AI agent rapidly exploring and evaluating candidate concepts in clinical notes and (ii) clinical and domain experts providing feedback to improve the CPM learning process. HACHI defines concepts as simple yes-no questions that are used in linear models, allowing the clinical AI team to transparently review, refine, and validate the CPM learned in each round. In two real-world prediction tasks (acute kidney injury and traumatic brain injury), HACHI outperforms existing approaches, surfaces new clinically relevant concepts not included in commonly-used CPMs, and improves model generalizability across clinical sites and time periods. Furthermore, HACHI reveals the critical role of the clinical AI team, such as directing the AI agent to explore concepts that it had not previously considered, adjusting the granularity of concepts it considers, changing the objective function to better align with the clinical objectives, and identifying issues of data bias and leakage.

翻译：开发安全、有效且具有实际应用价值的临床预测模型（CPMs）传统上需要临床专家、数据科学家和信息学专家之间的迭代协作。这一过程旨在完善模型构建过程中那些通常细微但至关重要的细节，例如应纳入哪些特征/患者以及如何定义临床类别。然而，这种传统的协作过程极其耗费时间和资源，导致仅有极小比例的CPMs能够最终应用于临床实践。当团队试图纳入非结构化的临床记录时，这一挑战会进一步加剧，因为其中可能包含海量的临床概念。为应对这一挑战，我们提出了HACHI，一种迭代式人在回路框架，该框架利用AI智能体通过探索临床记录中的概念来加速开发完全可解释的CPMs。HACHI交替进行以下两个步骤：(i) AI智能体快速探索和评估临床记录中的候选概念；(ii) 临床及领域专家提供反馈以改进CPMs的学习过程。HACHI将概念定义为用于线性模型的简单是非问题，这使得临床AI团队能够透明地审查、优化和验证每一轮学习得到的CPM。在两个真实世界的预测任务（急性肾损伤和创伤性脑损伤）中，HACHI的表现优于现有方法，揭示了常用CPMs中未包含的、具有临床相关性的新概念，并提升了模型在不同临床地点和时期之间的泛化能力。此外，HACHI揭示了临床AI团队的关键作用，例如引导AI智能体探索其先前未考虑的概念、调整其考虑概念的粒度、更改目标函数以更好地与临床目标对齐，以及识别数据偏差和数据泄露问题。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

超越生成式人工智能：用于临床预测、反事实推断与规划的世界模型

专知会员服务

22+阅读 · 2025年11月23日

【MIT博士论文】医学人工智能中的自然语言基础模型

专知会员服务

15+阅读 · 2025年4月2日

【新书】大模型与生成式人工智能医学应用，58页pdf

专知会员服务

109+阅读 · 2024年3月19日