Grounding Clinical AI Competency in Human Cognition Through the Clinical World Model and Skill-Mix Framework

Seyed Amir Ahmad Safavi-Naini,Elahe Meftah,Josh Mohess,Pooya Mohammadi Kazaj,Georgios Siontis,Zahra Atf,Peter R. Lewis,Mauricio Reyes,Girish Nadkarni,Roland Wiest,Stephan Windecker,Christoph Grani,Ali Soroush,Isaac Shiri

from arxiv, Code, data (Clinical AI Skill-Mix dimension specifications), and an exploratory dashboard are available at https://github.com/Sdamirsa/Clinical-World-Model

The competency of any intelligent agent is bounded by its formal account of the world in which it operates. Clinical AI lacks such an account. Existing frameworks address evaluation, regulation, or system design in isolation, without a shared model of the clinical world to connect them. We introduce the Clinical World Model, a framework that formalizes care as a tripartite interaction among Patient, Provider, and Ecosystem. To formalize how any agent, whether human or artificial, transforms information into clinical action, we develop parallel decision-making architectures for providers, patients, and AI agents, grounded in validated principles of clinical cognition. The Clinical AI Skill-Mix operationalizes competency through eight dimensions. Five define the clinical competency space (condition, phase, care setting, provider role, and task) and three specify how AI engages human reasoning (assigned authority, agent facing, and anchoring layer). The combinatorial product of these dimensions yields a space of billions of distinct competency coordinates. A central structural implication is that validation within one coordinate provides minimal evidence for performance in another, rendering the competency space irreducible. The framework supplies a common grammar through which clinical AI can be specified, evaluated, and bounded across stakeholders. By making this structure explicit, the Clinical World Model reframes the field's central question from whether AI works to in which competency coordinates reliability has been demonstrated, and for whom.

翻译：任何智能体的能力边界由其运行世界的形式化描述所界定。临床人工智能缺乏这样的描述。现有框架孤立地处理评估、监管或系统设计问题，缺乏连接这些要素的共享临床世界模型。我们提出临床世界模型（Clinical World Model）这一框架，将照护形式化为患者、提供者与生态系统三方交互。为了形式化任何智能体（无论是人类还是人工智能）如何将信息转化为临床行动，我们基于经过验证的临床认知原则，构建了提供者、患者与AI智能体的并行决策架构。临床AI技能混合（Clinical AI Skill-Mix）通过八个维度将能力具象化：五个维度定义临床能力空间（病情、阶段、照护环境、提供者角色与任务），三个维度指定AI介入人类推理的方式（分配权限、面向主体与锚定层级）。这些维度的组合乘积产生数十亿个独立能力坐标。一个核心结构意涵是：在单一坐标内的验证几乎不能为其他坐标的性能提供证据，这使得能力空间不可约简。该框架提供了一套通用语法，使临床AI能够在各利益相关方之间被明确指定、评估与约束。通过将这一结构显性化，临床世界模型将该领域的核心问题从"AI是否有效"重构为"在哪些能力坐标上，且为谁验证了可靠性"。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

AI 智能体系统：体系架构、应用场景及评估范式

专知会员服务

70+阅读 · 1月6日

超越生成式人工智能：用于临床预测、反事实推断与规划的世界模型

专知会员服务

22+阅读 · 2025年11月23日

《医疗领域中的具身人工智能综述：技术、应用与机会》

专知会员服务

29+阅读 · 2025年1月14日

《医学中的生成式人工智能》

专知会员服务

47+阅读 · 2024年12月16日