A Foundation Model for Patient Behavior Monitoring and Suicide Detection

Foundation models (FMs) have achieved remarkable success across various domains, yet their adoption in healthcare remains limited. While significant advances have been made in medical imaging, genetic biomarkers, and time series from electronic health records, the potential of FMs for patient behavior monitoring through wearable devices remains underexplored. These datasets are inherently heterogeneous, multisource, and often exhibit high rates of missing data, posing unique challenges. This paper introduces a novel FM based on a modified vector quantized variational autoencoder (VQ-VAE), specifically designed to process real-world data from wearable devices. We demonstrate that our pretrained FM, trained on a broad cohort of psychiatric patients, performs downstream tasks via its latent representation without fine-tuning on a held-out cohort of suicidal patients. To illustrate this, we develop a probabilistic change-point detection algorithm for suicide detection and demonstrate the FM's effectiveness in predicting emotional states. Our results show that the discrete latent structure of the VQ-VAE outperforms a state-of-the-art Informer architecture in unsupervised suicide detection, while matching its performance in supervised emotion prediction when the latent dimensionality is increased, though at the cost of reduced unsupervised accuracy. This trade-off highlights the need for future FMs to integrate hybrid discrete-continuous structures for balanced performance across tasks.

翻译：基础模型（FMs）已在多个领域取得显著成功，但其在医疗健康领域的应用仍较为有限。尽管在医学影像、遗传生物标志物以及电子健康记录的时间序列分析方面已取得重大进展，但利用可穿戴设备进行患者行为监测的基础模型潜力尚未得到充分探索。此类数据集具有固有的异质性、多源性，且通常存在较高的数据缺失率，带来了独特的挑战。本文提出了一种基于改进的向量量化变分自编码器（VQ-VAE）的新型基础模型，专门用于处理来自可穿戴设备的真实世界数据。我们证明，在广泛的精神病患者队列上预训练的基础模型，能够通过其潜在表征在未经过微调的情况下，对独立保留的自杀患者队列执行下游任务。为验证此点，我们开发了一种用于自杀检测的概率性变点检测算法，并展示了该模型在预测情绪状态方面的有效性。结果表明，在无监督自杀检测任务中，VQ-VAE的离散潜在结构优于当前最先进的Informer架构；而在增加潜在维度后，其有监督情绪预测性能与Informer相当，但会以无监督检测精度下降为代价。这种权衡凸显了未来基础模型需要整合离散-连续混合结构，以实现跨任务的平衡性能。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日