Explainable AI for clinical risk prediction: a survey of concepts, methods, and modalities

Recent advancements in AI applications to healthcare have shown incredible promise in surpassing human performance in diagnosis and disease prognosis. With the increasing complexity of AI models, however, concerns regarding their opacity, potential biases, and the need for interpretability. To ensure trust and reliability in AI systems, especially in clinical risk prediction models, explainability becomes crucial. Explainability is usually referred to as an AI system's ability to provide a robust interpretation of its decision-making logic or the decisions themselves to human stakeholders. In clinical risk prediction, other aspects of explainability like fairness, bias, trust, and transparency also represent important concepts beyond just interpretability. In this review, we address the relationship between these concepts as they are often used together or interchangeably. This review also discusses recent progress in developing explainable models for clinical risk prediction, highlighting the importance of quantitative and clinical evaluation and validation across multiple common modalities in clinical practice. It emphasizes the need for external validation and the combination of diverse interpretability methods to enhance trust and fairness. Adopting rigorous testing, such as using synthetic datasets with known generative factors, can further improve the reliability of explainability methods. Open access and code-sharing resources are essential for transparency and reproducibility, enabling the growth and trustworthiness of explainable research. While challenges exist, an end-to-end approach to explainability in clinical risk prediction, incorporating stakeholders from clinicians to developers, is essential for success.

翻译：近年来，人工智能在医疗领域的应用展现出超越人类在诊断和疾病预后方面的惊人潜力。然而，随着AI模型日趋复杂，其黑箱性、潜在偏差及对可解释性的需求引发关注。为确保AI系统（尤其是临床风险预测模型）的可靠性与可信度，可解释性至关重要。可解释性通常指AI系统向人类利益相关者提供其决策逻辑或决策本身稳健解释的能力。在临床风险预测中，公平性、偏差、可信度与透明度等可解释性的其他维度，同样是超越单纯可解释性的重要概念。本综述梳理了这些常被混用或交替使用的概念之间的关联，同时探讨了开发面向临床风险预测的可解释模型的最新进展，强调在临床实践多种常见模态中进行定量评估与临床评估验证的重要性。研究指出，采用外部验证、融合多样化可解释性方法有助于增强可信度与公平性。通过采用已知生成因子的合成数据集等严格测试手段，可进一步提升可解释性方法的可靠性。开放获取与代码共享资源对确保透明度与可重复性至关重要，能够促进可解释性研究的健康发展与可信度提升。尽管存在挑战，但通过整合从临床医生到开发人员等利益相关者的端到端可解释性方法，是临床风险预测领域取得成功的关键。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日