Dynamic Erasing Network Based on Multi-Scale Temporal Features for Weakly Supervised Video Anomaly Detection

The goal of weakly supervised video anomaly detection is to learn a detection model using only video-level labeled data. However, prior studies typically divide videos into fixed-length segments without considering the complexity or duration of anomalies. Moreover, these studies usually just detect the most abnormal segments, potentially overlooking the completeness of anomalies. To address these limitations, we propose a Dynamic Erasing Network (DE-Net) for weakly supervised video anomaly detection, which learns multi-scale temporal features. Specifically, to handle duration variations of abnormal events, we first propose a multi-scale temporal modeling module, capable of extracting features from segments of varying lengths and capturing both local and global visual information across different temporal scales. Then, we design a dynamic erasing strategy, which dynamically assesses the completeness of the detected anomalies and erases prominent abnormal segments in order to encourage the model to discover gentle abnormal segments in a video. The proposed method obtains favorable performance compared to several state-of-the-art approaches on three datasets: XD-Violence, TAD, and UCF-Crime. Code will be made available at https://github.com/ArielZc/DE-Net.

翻译：弱监督视频异常检测的目标是仅利用视频级别标注数据来学习检测模型。然而，现有研究通常将视频划分为固定长度片段，未考虑异常的复杂性或持续时间。此外，这些研究通常仅检测最异常的片段，可能忽略异常事件的完整性。为克服这些局限，我们提出了一种用于弱监督视频异常检测的动态擦除网络（Dynamic Erasing Network, DE-Net），该网络学习多尺度时序特征。具体而言，为处理异常事件持续时间的差异，我们首先提出多尺度时序建模模块，该模块能够提取不同长度片段的特征，并捕获不同时间尺度下的局部与全局视觉信息。随后，我们设计了一种动态擦除策略，该策略动态评估已检测异常的完整性，并擦除显著异常片段，以促使模型发现视频中的细微异常片段。在三个数据集（XD-Violence、TAD和UCF-Crime）上，所提方法相比多种前沿方法取得了更优性能。代码将在https://github.com/ArielZc/DE-Net公开。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日