Remote photoplethysmography (rPPG) is a noninvasive technique that aims to capture subtle variations in facial pixels caused by changes in blood volume resulting from cardiac activities. Most existing unsupervised methods for rPPG tasks focus on the contrastive learning between samples while neglecting the inherent self-similar prior in physiological signals. In this paper, we propose a Self-Similarity Prior Distillation (SSPD) framework for unsupervised rPPG estimation, which capitalizes on the intrinsic self-similarity of cardiac activities. Specifically, we first introduce a physical-prior embedded augmentation technique to mitigate the effect of various types of noise. Then, we tailor a self-similarity-aware network to extract more reliable self-similar physiological features. Finally, we develop a hierarchical self-distillation paradigm to assist the network in disentangling self-similar physiological patterns from facial videos. Comprehensive experiments demonstrate that the unsupervised SSPD framework achieves comparable or even superior performance compared to the state-of-the-art supervised methods. Meanwhile, SSPD maintains the lowest inference time and computation cost among end-to-end models. The source codes are available at https://github.com/LinXi1C/SSPD.
翻译:远程光电容积描记术(rPPG)是一种非侵入性技术,旨在捕捉由心脏活动引起的血容量变化导致的面部像素细微变化。现有大多数无监督rPPG方法侧重于样本间的对比学习,却忽视了生理信号中固有的自相似先验。本文提出一种用于无监督rPPG估计的自相似先验蒸馏(SSPD)框架,该框架充分利用心脏活动的内在自相似性。具体而言,我们首先引入一种物理先验嵌入增强技术,以减轻各类噪声的影响;随后定制设计自相似感知网络,提取更可靠的自相似生理特征;最后,我们开发了一种层次化自蒸馏范式,协助网络从面部视频中解耦自相似生理模式。综合实验表明,无监督SSPD框架达到了与最先进有监督方法相当甚至更优的性能。同时,SSPD在端到端模型中保持了最低的推理时间和计算成本。源代码已开源至https://github.com/LinXi1C/SSPD。