QualityFM：面向危重患者信号质量挑战的多模态生理信号基础模型与自蒸馏技术 (QualityFM: a Multimodal Physiological Signal Foundation Model with Self-Distillation for Signal Quality Challenges in Critically Ill Patients)

Photoplethysmogram (PPG) and electrocardiogram (ECG) are commonly recorded in intesive care unit (ICU) and operating room (OR). However, the high incidence of poor, incomplete, and inconsistent signal quality, can lead to false alarms or diagnostic inaccuracies. The methods explored so far suffer from limited generalizability, reliance on extensive labeled data, and poor cross-task transferability. To overcome these challenges, we introduce QualityFM, a novel multimodal foundation model for these physiological signals, designed to acquire a general-purpose understanding of signal quality. Our model is pre-trained on an large-scale dataset comprising over 21 million 30-second waveforms and 179,757 hours of data. Our approach involves a dual-track architecture that processes paired physiological signals of differing quality, leveraging a self-distillation strategy where an encoder for high-quality signals is used to guide the training of an encoder for low-quality signals. To efficiently handle long sequential signals and capture essential local quasi-periodic patterns, we integrate a windowed sparse attention mechanism within our Transformer-based model. Furthermore, a composite loss function, which combines direct distillation loss on encoder outputs with indirect reconstruction loss based on power and phase spectra, ensures the preservation of frequency-domain characteristics of the signals. We pre-train three models with varying parameter counts (9.6 M to 319 M) and demonstrate their efficacy and practical value through transfer learning on three distinct clinical tasks: false alarm of ventricular tachycardia detection, the identification of atrial fibrillation and the estimation of arterial blood pressure (ABP) from PPG and ECG signals.

翻译：光电容积描记图（PPG）与心电图（ECG）是重症监护病房（ICU）和手术室（OR）中常规记录的生理信号。然而，信号质量差、不完整或不一致的高发生率可能导致误报警或诊断错误。现有方法普遍存在泛化能力有限、依赖大量标注数据以及跨任务迁移性差等问题。为应对这些挑战，我们提出了QualityFM——一种面向此类生理信号的新型多模态基础模型，旨在获得对信号质量的通用理解。该模型基于包含超过2100万段30秒波形、总计179,757小时数据的大规模数据集进行预训练。我们采用双通道架构处理成对的不同质量生理信号，并利用自蒸馏策略，以高质量信号编码器指导低质量信号编码器的训练。为高效处理长序列信号并捕捉关键的局部准周期模式，我们在基于Transformer的模型中集成了窗口稀疏注意力机制。此外，通过结合编码器输出的直接蒸馏损失与基于功率谱和相位谱的间接重建损失的复合损失函数，确保了信号频域特性的保留。我们预训练了三种参数量级不同的模型（9.6M至319M），并通过在三个独立临床任务上的迁移学习验证了其效能与实用价值：室性心动过速检测的误报警识别、心房颤动的检测，以及基于PPG和ECG信号的动脉血压（ABP）估计。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

31+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日