The fundamental problem of risk prediction for individuals: health AI, uncertainty, and personalized medicine

Background and Objective: Clinical prediction models are commonly evaluated regarding performance for a population, although decisions are made for individuals. The classic view relates uncertainty in risk estimates for individuals to sample size (estimation uncertainty) while other sources are model uncertainty (variability in modeling choices) and applicability uncertainty (variability in measurement procedures and between populations). We aim to illustrate the uncertainty of prediction models in estimating individual risks with an ovarian cancer example. Methods: We used real and synthetic data for ovarian cancer diagnosis to train 59400 models with variations in estimation, model, and applicability uncertainty. We then used these models to estimate the probability of ovarian cancer in a fixed test set of 100 patients and evaluate the variability in individual estimates. Results: We show empirically that estimation uncertainty can be strongly dominated by model uncertainty and applicability uncertainty, even for models that perform well at the population level. Estimation uncertainty decreased considerably with increasing training sample size, whereas model and applicability uncertainty remained large. Conclusion: Individual risk estimates are far more uncertain than often assumed. Model uncertainty and applicability uncertainty usually remain invisible when prediction models or algorithms are based on a single study. Predictive algorithms should inform, not dictate, care and support personalization through clinician-patient interaction.

翻译：背景与目的：临床预测模型通常在群体层面进行评估，但实际决策需针对个体制定。经典观点将个体风险估计的不确定性归因于样本量（估计不确定性），而其他来源包括模型不确定性（建模选择的变异性）与应用性不确定性（测量流程及群体间的变异性）。本研究拟以卵巢癌为例，阐明预测模型在个体风险估计中的不确定性。方法：利用卵巢癌诊断的真实与合成数据，构建59400个涵盖估计、模型及应用性不确定性的模型。将这些模型应用于固定测试集（100例患者）以估计卵巢癌概率，并评估个体估计值的变异性。结果：实证表明，即使群体层面表现优异的模型，估计不确定性仍可能被模型不确定性和应用性不确定性显著主导。估计不确定性随训练样本量增加而显著降低，而模型不确定性与应用性不确定性仍保持较大。结论：个体风险估计的不确定性远超常规认知。当预测模型或算法基于单一研究时，模型不确定性与应用性不确定性通常被掩盖。预测性算法应辅助而非主导诊疗决策，通过医患互动支持个性化医疗。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

可信智能体AI综述：安全、鲁棒性、隐私与系统安全

专知会员服务

6+阅读 · 6月14日

具身AI安全综述：风险、攻击与防御

专知会员服务

11+阅读 · 5月6日

迈向个性化大语言模型驱动的智能体：基础、评估与未来方向

专知会员服务

28+阅读 · 2月27日

认知优势：人工智能在国家安全决策中的核心作用

专知会员服务

15+阅读 · 2025年8月16日