Bounding Boxes and Probabilistic Graphical Models: Video Anomaly Detection Simplified

from arxiv, Accepted for publication at GCPR 2024, after peer review. Use of this Accepted Version is subject to the publisher's Accepted Manuscript terms of use https://www.springer-nature.com/gp/open-research/policies/accepted-manuscript-terms. Code available on GitHub: https://github.com/milestonesys-research/VAD-with-PGMs/

In this study, we formulate the task of Video Anomaly Detection as a probabilistic analysis of object bounding boxes. We hypothesize that the representation of objects via their bounding boxes only, can be sufficient to successfully identify anomalous events in a scene. The implied value of this approach is increased object anonymization, faster model training and fewer computational resources. This can particularly benefit applications within video surveillance running on edge devices such as cameras. We design our model based on human reasoning which lends itself to explaining model output in human-understandable terms. Meanwhile, the slowest model trains within less than 7 seconds on a 11th Generation Intel Core i9 Processor. While our approach constitutes a drastic reduction of problem feature space in comparison with prior art, we show that this does not result in a reduction in performance: the results we report are highly competitive on the benchmark datasets CUHK Avenue and ShanghaiTech, and significantly exceed on the latest State-of-the-Art results on StreetScene, which has so far proven to be the most challenging VAD dataset.

翻译：在本研究中，我们将视频异常检测任务形式化为对物体边界框的概率分析。我们假设仅通过边界框表示物体，便足以成功识别场景中的异常事件。该方法隐含的价值在于增强物体匿名性、加快模型训练速度并减少计算资源消耗。这对于在摄像头等边缘设备上运行的视频监控应用尤为有益。我们基于人类推理设计模型，使其能够以人类可理解的术语解释模型输出。同时，最慢的模型在第11代英特尔酷睿i9处理器上的训练时间不足7秒。尽管相较于现有技术，我们的方法实现了问题特征空间的急剧压缩，但我们证明这并未导致性能下降：我们在基准数据集CUHK Avenue和ShanghaiTech上报告的结果具有高度竞争力，并在目前最具挑战性的VAD数据集StreetScene上显著超越了最新的最先进结果。

相关内容

MoDELS

关注 46

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日