In this study, we address the interpretability issue in complex, black-box Machine Learning models applied to sequence data. We introduce the Model-Based tree Hidden Semi-Markov Model (MOB-HSMM), an inherently interpretable model aimed at detecting high mortality risk events and discovering hidden patterns associated with the mortality risk in Intensive Care Units (ICU). This model leverages knowledge distilled from Deep Neural Networks (DNN) to enhance predictive performance while offering clear explanations. Our experimental results indicate the improved performance of Model-Based trees (MOB trees) via employing LSTM for learning sequential patterns, which are then transferred to MOB trees. Integrating MOB trees with the Hidden Semi-Markov Model (HSMM) in the MOB-HSMM enables uncovering potential and explainable sequences using available information.
翻译:本研究旨在解决应用于序列数据的复杂黑箱机器学习模型的可解释性问题。我们提出了一种具有内在可解释性的基于模型的树状隐半马尔可夫模型(MOB-HSMM),用于检测重症监护病房(ICU)中的高死亡风险事件,并发现与死亡风险相关的隐藏模式。该模型通过利用从深度神经网络(DNN)中提炼的知识,在提升预测性能的同时提供清晰的解释。实验结果表明,通过采用LSTM学习序列模式并将其迁移至基于模型的树(MOB树),MOB树的性能得到提升。将MOB树与隐半马尔可夫模型(HSMM)整合至MOB-HSMM中,能够利用现有信息揭示潜在且可解释的序列。