Conventional survival analysis methods are typically ineffective to characterize heterogeneity in the population while such information can be used to assist predictive modeling. In this study, we propose a hybrid survival analysis method, referred to as deep clustering survival machines, that combines the discriminative and generative mechanisms. Similar to the mixture models, we assume that the timing information of survival data is generatively described by a mixture of certain numbers of parametric distributions, i.e., expert distributions. We learn weights of the expert distributions for individual instances according to their features discriminatively such that each instance's survival information can be characterized by a weighted combination of the learned constant expert distributions. This method also facilitates interpretable subgrouping/clustering of all instances according to their associated expert distributions. Extensive experiments on both real and synthetic datasets have demonstrated that the method is capable of obtaining promising clustering results and competitive time-to-event predicting performance.
翻译:传统的生存分析方法通常难以有效刻画人群异质性,而此类信息可用于辅助预测建模。本研究提出一种混合生存分析方法——深度聚类生存机器,它融合了判别式与生成式机制。与混合模型类似,我们假设生存数据的时间信息是由若干参数化分布(即专家分布)的混合模型生成性地描述的。我们根据每个实例的特征判别性地学习其对应的专家分布权重,使得每个实例的生存信息可通过已学习到的恒定专家分布的加权组合来刻画。该方法还能根据实例关联的专家分布,实现对所有实例的可解释性子组划分/聚类。在真实数据集与合成数据集上的大量实验表明,该方法既能获得有前景的聚类结果,又具有竞争力的时间事件预测性能。