PAD: A Dataset and Benchmark for Pose-agnostic Anomaly Detection

Object anomaly detection is an important problem in the field of machine vision and has seen remarkable progress recently. However, two significant challenges hinder its research and application. First, existing datasets lack comprehensive visual information from various pose angles. They usually have an unrealistic assumption that the anomaly-free training dataset is pose-aligned, and the testing samples have the same pose as the training data. However, in practice, anomaly may exist in any regions on a object, the training and query samples may have different poses, calling for the study on pose-agnostic anomaly detection. Second, the absence of a consensus on experimental protocols for pose-agnostic anomaly detection leads to unfair comparisons of different methods, hindering the research on pose-agnostic anomaly detection. To address these issues, we develop Multi-pose Anomaly Detection (MAD) dataset and Pose-agnostic Anomaly Detection (PAD) benchmark, which takes the first step to address the pose-agnostic anomaly detection problem. Specifically, we build MAD using 20 complex-shaped LEGO toys including 4K views with various poses, and high-quality and diverse 3D anomalies in both simulated and real environments. Additionally, we propose a novel method OmniposeAD, trained using MAD, specifically designed for pose-agnostic anomaly detection. Through comprehensive evaluations, we demonstrate the relevance of our dataset and method. Furthermore, we provide an open-source benchmark library, including dataset and baseline methods that cover 8 anomaly detection paradigms, to facilitate future research and application in this domain. Code, data, and models are publicly available at https://github.com/EricLee0224/PAD.

翻译：物体异常检测是机器视觉领域的重要问题，近年来取得了显著进展。然而，两大挑战阻碍了其研究与应用。首先，现有数据集缺乏多视角姿态的全面视觉信息。它们通常持有不现实的假设：无异常训练数据集是姿态对齐的，且测试样本与训练数据具有相同姿态。但在实际应用中，异常可能出现在物体的任意区域，训练样本与查询样本可能具有不同姿态，这亟需研究姿态无关的异常检测。其次，由于缺乏姿态无关异常检测的实验协议共识，不同方法的比较标准不统一，阻碍了该领域的研究进展。为解决这些问题，我们构建了多姿态异常检测（MAD）数据集与姿态无关异常检测（PAD）基准，率先迈出了解决姿态无关异常检测问题的第一步。具体而言，我们利用20个复杂形状的乐高玩具构建MAD，包含4K视角的多姿态图像，并在模拟与真实环境中引入高质量、多样化的3D异常。此外，我们提出了一种基于MAD训练的新方法OmniposeAD，专用于姿态无关异常检测。通过全面评估，我们验证了数据集与方法的有效性。进一步地，我们提供了开源基准库，涵盖数据集及覆盖8种异常检测范式的基线方法，以推动该领域的未来研究与应用。代码、数据和模型已公开于https://github.com/EricLee0224/PAD。

相关内容

异常检测

关注 102

在数据挖掘中，异常检测（英语：anomaly detection）对不符合预期模式或数据集中其他项目的项目、事件或观测值的识别。通常异常项目会转变成银行欺诈、结构缺陷、医疗问题、文本错误等类型的问题。异常也被称为离群值、新奇、噪声、偏差和例外。特别是在检测滥用与网络入侵时，有趣性对象往往不是罕见对象，但却是超出预料的突发活动。这种模式不遵循通常统计定义中把异常点看作是罕见对象，于是许多异常检测方法（特别是无监督的方法）将对此类数据失效，除非进行了合适的聚集。相反，聚类分析算法可能可以检测出这些模式形成的微聚类。有三大类异常检测方法。[1] 在假设数据集中大多数实例都是正常的前提下，无监督异常检测方法能通过寻找与其他数据最不匹配的实例来检测出未标记测试数据的异常。监督式异常检测方法需要一个已经被标记“正常”与“异常”的数据集，并涉及到训练分类器（与许多其他的统计分类问题的关键区别是异常检测的内在不均衡性）。半监督式异常检测方法根据一个给定的正常训练数据集创建一个表示正常行为的模型，然后检测由学习模型生成的测试实例的可能性。

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日