Bayesian analysis of biomarker levels can predict time of recurrence of prostate cancer with strictly positive apparent Shannon information against an exponential attrition prior

2024 年 4 月 27 日

翻译：基于生物标志物水平的贝叶斯分析可在指数衰减先验下以严格正的表观香农信息预测前列腺癌复发时间

Roger Sewell,Elisabeth Crowe,Sharokh F. Shariat

from arxiv, 16 pages, 8 figures

Shariat et al previously investigated the possibility of predicting, from preoperative biomarkers and clinical data, which of any pair of patients would suffer recurrence of prostate cancer first. We wished to establish the extent to which predictions of time of relapse from such a model could be improved upon using Bayesian methodology. The same dataset was reanalysed using a Bayesian skew-Student mixture model. Predictions were made of which of any pair of patients would relapse first and of the time of relapse. The benefit of using these biomarkers relative to predictions made without them, was measured by the apparent Shannon information, using as prior a simple exponential attrition model of relapse time independent of input variables. Using half the dataset for training and the other half for testing, predictions of relapse time from the strict Cox model gave $-\infty$ nepers of apparent Shannon information, (it predicts that relapse can only occur at times when patients in the training set relapsed). Deliberately smoothed predictions from the Cox model gave -0.001 (-0.131 to +0.120) nepers, while the Bayesian model gave +0.109 (+0.021 to +0.192) nepers (mean, 2.5 to 97.5 centiles), being positive with posterior probability 0.993 and beating the blurred Cox model with posterior probability 0.927. These predictions from the Bayesian model thus outperform those of the Cox model, but the overall yield of predictive information leaves scope for improvement of the range of biomarkers in use. The Bayesian model presented here is the first such model for prostate cancer to consider the variation of relapse hazard with biomarker concentrations to be smooth, as is intuitive. It is also the first model to be shown to provide more apparent Shannon information than the Cox model and the first to be shown to provide positive apparent information relative to an exponential prior.

翻译：Shariat等人先前研究了通过术前生物标志物和临床数据预测任意一对患者中哪一位会先发生前列腺癌复发的可能性。我们旨在确定使用贝叶斯方法在多大程度上能改善此类模型对复发时间的预测效果。采用贝叶斯偏斜学生混合模型对同一数据集重新进行分析，预测了任意一对患者中谁将先复发以及具体的复发时间。以与输入变量无关的简单指数衰减复发时间模型作为先验，通过表观香农信息衡量使用这些生物标志物相较于不使用时的获益程度。使用一半数据集进行训练、另一半进行测试时，严格Cox模型对复发时间的预测产生了-∞奈特的表观香农信息（该模型预测复发仅可能发生在训练集患者复发的对应时间点）。经刻意平滑处理的Cox模型预测结果为-0.001（-0.131至+0.120）奈特，而贝叶斯模型预测结果为+0.109（+0.021至+0.192）奈特（均值及2.5至97.5百分位数），其后验概率为0.993时呈正值，并以0.927的后验概率优于模糊Cox模型。因此，贝叶斯模型的预测性能优于Cox模型，但预测信息的整体产出仍为现有生物标志物范围的改进留有余地。本文提出的贝叶斯模型是首个考虑复发风险随生物标志物浓度平滑变化（符合直觉）的前列腺癌模型，也是首个被证明比Cox模型提供更多表观香农信息、且相对于指数先验提供正表观信息的模型。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Nat. Biotechnol. | 机器学习为生物库驱动的药物发现提供动力

专知会员服务

11+阅读 · 2022年9月12日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日