Predicting Extubation Failure in Intensive Care: The Development of a Novel, End-to-End Actionable and Interpretable Prediction System

from arxiv, Thesis submitted in fulfilment of requirements for the degree of Master of Science in Computing - Department of Computing, Imperial College London

Predicting extubation failure in intensive care is challenging due to complex data and the severe consequences of inaccurate predictions. Machine learning shows promise in improving clinical decision-making but often fails to account for temporal patient trajectories and model interpretability, highlighting the need for innovative solutions. This study aimed to develop an actionable, interpretable prediction system for extubation failure using temporal modelling approaches such as Long Short-Term Memory (LSTM) and Temporal Convolutional Networks (TCN). A retrospective cohort study of 4,701 mechanically ventilated patients from the MIMIC-IV database was conducted. Data from the 6 hours before extubation, including static and dynamic features, were processed through novel techniques addressing data inconsistency and synthetic data challenges. Feature selection was guided by clinical relevance and literature benchmarks. Iterative experimentation involved training LSTM, TCN, and LightGBM models. Initial results showed a strong bias toward predicting extubation success, despite advanced hyperparameter tuning and static data inclusion. Data was stratified by sampling frequency to reduce synthetic data impacts, leading to a fused decision system with improved performance. However, all architectures yielded modest predictive power (AUC-ROC ~0.6; F1 <0.5) with no clear advantage in incorporating static data or additional features. Ablation analysis indicated minimal impact of individual features on model performance. This thesis highlights the challenges of synthetic data in extubation failure prediction and introduces strategies to mitigate bias, including clinician-informed preprocessing and novel feature subsetting. While performance was limited, the study provides a foundation for future work, emphasising the need for reliable, interpretable models to optimise ICU outcomes.

翻译：预测重症监护中的拔管失败具有挑战性，原因在于数据复杂且预测不准确的后果严重。机器学习在改善临床决策方面显示出潜力，但往往未能考虑患者的时间轨迹和模型的可解释性，这凸显了对创新解决方案的需求。本研究旨在利用长短期记忆网络（LSTM）和时序卷积网络（TCN）等时序建模方法，开发一种可操作、可解释的拔管失败预测系统。研究对来自MIMIC-IV数据库的4,701名机械通气患者进行了回顾性队列研究。拔管前6小时的数据，包括静态和动态特征，通过处理数据不一致性和合成数据挑战的新技术进行处理。特征选择以临床相关性和文献基准为指导。迭代实验涉及训练LSTM、TCN和LightGBM模型。初步结果显示，尽管进行了高级超参数调优并纳入了静态数据，模型仍存在强烈偏向预测拔管成功的偏差。通过按采样频率对数据进行分层以减少合成数据的影响，最终形成了一个性能改进的融合决策系统。然而，所有架构的预测能力均有限（AUC-ROC ~0.6；F1 <0.5），且在纳入静态数据或额外特征方面无明显优势。消融分析表明，单个特征对模型性能的影响微乎其微。本论文强调了合成数据在拔管失败预测中的挑战，并介绍了减轻偏差的策略，包括基于临床知识的预处理和新的特征子集划分方法。尽管性能有限，但本研究为未来工作奠定了基础，强调了需要可靠、可解释的模型以优化重症监护室（ICU）的治疗结果。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日