Self-Distilled Masked Auto-Encoders are Efficient Video Anomaly Detectors

We propose an efficient abnormal event detection model based on a lightweight masked auto-encoder (AE) applied at the video frame level. The novelty of the proposed model is threefold. First, we introduce an approach to weight tokens based on motion gradients, thus avoiding learning to reconstruct the static background scene. Second, we integrate a teacher decoder and a student decoder into our architecture, leveraging the discrepancy between the outputs given by the two decoders to improve anomaly detection. Third, we generate synthetic abnormal events to augment the training videos, and task the masked AE model to jointly reconstruct the original frames (without anomalies) and the corresponding pixel-level anomaly maps. Our design leads to an efficient and effective model, as demonstrated by the extensive experiments carried out on three benchmarks: Avenue, ShanghaiTech and UCSD Ped2. The empirical results show that our model achieves an excellent trade-off between speed and accuracy, obtaining competitive AUC scores, while processing 1670 FPS. Hence, our model is between 8 and 70 times faster than competing methods. We also conduct an ablation study to justify our design.

翻译：我们提出了一种基于轻量级掩码自编码器（AE）的高效异常事件检测模型，该模型在视频帧级别上运行。本模型的创新性体现在三个方面。首先，我们引入了一种基于运动梯度对令牌进行加权的方法，从而避免学习重建静态背景场景。其次，我们集成了教师解码器与学生解码器，利用两个解码器输出之间的差异来提升异常检测性能。第三，我们通过生成合成异常事件来扩充训练视频，并让掩码AE模型联合重建原始帧（不含异常）及对应的像素级异常图。在Avenue、ShanghaiTech和UCSD Ped2三个基准数据集上的大量实验表明，我们的设计方案实现了高效且有效的模型。实验结果显示，该模型在速度与准确性之间取得了优异平衡，在获得具有竞争力的AUC分数的同时，处理速度达到1670 FPS。因此，我们的模型比现有竞争方法快8到70倍。此外，我们还通过消融研究验证了设计方案的合理性。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日