Conventional survival analysis methods are typically ineffective to characterize heterogeneity in the population while such information can be used to assist predictive modeling. In this study, we propose a hybrid survival analysis method, referred to as deep clustering survival machines, that combines the discriminative and generative mechanisms. Similar to the mixture models, we assume that the timing information of survival data is generatively described by a mixture of certain numbers of parametric distributions, i.e., expert distributions. We learn weights of the expert distributions for individual instances according to their features discriminatively such that each instance's survival information can be characterized by a weighted combination of the learned constant expert distributions. This method also facilitates interpretable subgrouping/clustering of all instances according to their associated expert distributions. Extensive experiments on both real and synthetic datasets have demonstrated that the method is capable of obtaining promising clustering results and competitive time-to-event predicting performance.
翻译:传统的生存分析方法通常难以有效刻画人群中的异质性,而此类信息可用于辅助预测建模。本研究提出一种混合生存分析方法——深度聚类生存机器,它融合了判别式与生成式机制。类似于混合模型,我们假设生存数据的时间信息由若干参数化分布(即专家分布)的混合模型生成式刻画。根据个体特征以判别式方式学习各专家分布的权重,从而每个个体的生存信息可由已习得的恒定专家分布的加权组合来表征。该方法还能根据个体关联的专家分布实现所有实例的可解释子群划分/聚类。在真实与合成数据集上的大量实验表明,该方法既能获得有前景的聚类结果,又能实现具有竞争力的时间至事件预测性能。