全自动睡眠分期：针对帕金森病及孤立性快速眼动睡眠行为障碍的可泛化深度神经网络的多中心验证 (Fully-automated sleep staging: multicenter validation of a generalizable deep neural network for Parkinson's disease and isolated REM sleep behavior disorder)

Fully-automated sleep staging: multicenter validation of a generalizable deep neural network for Parkinson's disease and isolated REM sleep behavior disorder

翻译：全自动睡眠分期：针对帕金森病及孤立性快速眼动睡眠行为障碍的可泛化深度神经网络的多中心验证

Jesper Strøm,Casper Skjærbæk,Natasha Becker Bertelsen,Steffen Torpe Simonsen,Niels Okkels,David Bertram,Sinah Röttgen,Konstantin Kufer,Kaare B. Mikkelsen,Marit Otto,Poul Jørgen Jennum,Per Borghammer,Michael Sommerauer,Preben Kidmose

from arxiv, 21 pages excluding supplementary, 9 figures

Isolated REM sleep behavior disorder (iRBD) is a key prodromal marker of Parkinson's disease (PD), and video-polysomnography (vPSG) remains the diagnostic gold standard. However, manual sleep staging is particularly challenging in neurodegenerative diseases due to EEG abnormalities and fragmented sleep, making PSG assessments a bottleneck for deploying new RBD screening technologies at scale. We adapted U-Sleep, a deep neural network, for generalizable sleep staging in PD and iRBD. A pretrained U-Sleep model, based on a large publicly available, multisite non-neurodegenerative dataset (PUB; 19,236 PSGs across 12 sites), was fine-tuned on research datasets from two centers (Lundbeck Foundation Parkinson's Disease Research Center (PACE) and the Cologne-Bonn Cohort (CBC); 112 PD, 138 iRBD, 89 age-matched controls. The resulting model was evaluated on an independent dataset from the Danish Center for Sleep Medicine (DCSM; 81 PD, 36 iRBD, 87 sleep-clinic controls). A subset of PSGs with low agreement between the human rater and the model (\k{appa} < 0.6) was re-scored by a second blinded human rater to identify sources of disagreement. Finally, we applied confidence-based thresholds to optimize REM sleep staging. The pretrained model achieved mean \k{appa} = 0.81 in PUB, but \k{appa} = 0.66 when applied directly to PACE/CBC. By fine-tuning the model, we developed a generalized model with \k{appa} = 0.74 on PACE/CBC (p < 0.001 vs. the pretrained model). In DCSM, mean and median \k{appa} increased from 0.60 to 0.64 (p < 0.001) and 0.64 to 0.69 (p < 0.001), respectively. In the interrater study, PSGs with low agreement between the model and the initial scorer showed similarly low agreement between human scorers. Applying a confidence threshold increased the proportion of correctly identified REM sleep epochs from 85% to 95.5%, while preserving sufficient (> 5 min) REM sleep for 95% of subjects.

翻译：孤立性快速眼动睡眠行为障碍（iRBD）是帕金森病（PD）的关键前驱标志物，而视频多导睡眠监测（vPSG）仍是诊断的金标准。然而，在神经退行性疾病中，由于脑电图异常和睡眠片段化，人工睡眠分期尤为困难，这使得PSG评估成为大规模部署新型RBD筛查技术的瓶颈。我们针对PD和iRBD，对深度神经网络U-Sleep进行了适应性调整，以实现可泛化的睡眠分期。一个基于大型公开、多中心非神经退行性疾病数据集（PUB；来自12个中心的19,236份PSG记录）预训练的U-Sleep模型，在两个研究中心（灵北基金会帕金森病研究中心（PACE）和科隆-波恩队列（CBC）；包含112例PD、138例iRBD、89例年龄匹配对照）的研究数据集上进行了微调。所得模型在一个来自丹麦睡眠医学中心（DCSM；包含81例PD、36例iRBD、87例睡眠门诊对照）的独立数据集上进行了评估。对于人工评分者与模型之间一致性较低（κ < 0.6）的PSG记录子集，由第二位盲法人工评分者重新评分，以识别不一致的来源。最后，我们应用基于置信度的阈值来优化快速眼动睡眠分期。预训练模型在PUB数据集中平均κ = 0.81，但直接应用于PACE/CBC时κ = 0.66。通过对模型进行微调，我们开发了一个泛化模型，在PACE/CBC数据集上κ = 0.74（与预训练模型相比，p < 0.001）。在DCSM数据集中，平均κ和中位数κ分别从0.60提升至0.64（p < 0.001）和从0.64提升至0.69（p < 0.001）。在评分者间研究中，模型与初始评分者一致性较低的PSG记录，在人类评分者之间也表现出类似较低的一致性。应用置信度阈值后，正确识别的快速眼动睡眠时段比例从85%提高至95.5%，同时为95%的受试者保留了足够（> 5分钟）的快速眼动睡眠。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

科研动态| 不依赖GPT-4的多模态幻觉评估benchmark来了！一键实现多维度幻觉自动分析

专知会员服务

26+阅读 · 2023年11月15日