Improving the future of healthcare starts by better understanding the current actual practices in hospital settings. This motivates the objective of discovering typical care pathways from patient data. Revealing typical care pathways can be achieved through clustering. The difficulty in clustering care pathways, represented by sequences of timestamped events, lies in defining a semantically appropriate metric and clustering algorithms. In this article, we adapt two methods developed for time series to the clustering of timed sequences: the drop-DTW metric and the DBA approach for the construction of averaged time sequences. These methods are then applied in clustering algorithms to propose original and sound clustering algorithms for timed sequences. This approach is experimented with and evaluated on synthetic and real-world data.
翻译:改善医疗保健的未来始于更好地理解医院环境中当前的实际实践。这促使了从患者数据中发现典型医疗路径的目标。揭示典型医疗路径可以通过聚类实现。医疗路径聚类(以带时间戳的事件序列表示)的难点在于定义语义上合适的度量和聚类算法。在本文中,我们将两种为时间序列开发的方法适配到时序序列的聚类中:用于时序序列的drop-DTW度量方法和用于构建平均时间序列的DBA方法。这些方法随后被应用于聚类算法中,从而为时序序列提出了原创且可靠的聚类算法。该方法在合成数据和真实数据上进行了实验与评估。