Conventional survival analysis methods are typically ineffective to characterize heterogeneity in the population while such information can be used to assist predictive modeling. In this study, we propose a hybrid survival analysis method, referred to as deep clustering survival machines, that combines the discriminative and generative mechanisms. Similar to the mixture models, we assume that the timing information of survival data is generatively described by a mixture of certain numbers of parametric distributions, i.e., expert distributions. We learn weights of the expert distributions for individual instances according to their features discriminatively such that each instance's survival information can be characterized by a weighted combination of the learned constant expert distributions. This method also facilitates interpretable subgrouping/clustering of all instances according to their associated expert distributions. Extensive experiments on both real and synthetic datasets have demonstrated that the method is capable of obtaining promising clustering results and competitive time-to-event predicting performance.
翻译:传统生存分析方法通常难以刻画人群异质性,而此类信息可用于辅助预测建模。本研究提出一种融合判别与生成机制的混合生存分析方法,称为深度聚类生存机模型。与混合模型类似,我们假设生存数据的时间信息由若干参数分布(即专家分布)的混合生成式描述。通过判别式学习各实例对应的专家分布权重,使得每个实例的生存信息可由学习到的恒定专家分布的加权组合表征。该方法还能根据实例关联的专家分布,实现对全体实例的可解释性子群划分/聚类。在真实与合成数据集上的大量实验表明,该方法能够获得具有前景的聚类结果,同时具备具有竞争力的时间-事件预测性能。