Open-set supervised anomaly detection (OSAD) - a recently emerging anomaly detection area - aims at utilizing a few samples of anomaly classes seen during training to detect unseen anomalies (i.e., samples from open-set anomaly classes), while effectively identifying the seen anomalies. Benefiting from the prior knowledge illustrated by the seen anomalies, current OSAD methods can often largely reduce false positive errors. However, these methods treat the anomaly examples as from a homogeneous distribution, rendering them less effective in generalizing to unseen anomalies that can be drawn from any distribution. In this paper, we propose to learn heterogeneous anomaly distributions using the limited anomaly examples to address this issue. To this end, we introduce a novel approach, namely Anomaly Heterogeneity Learning (AHL), that simulates a diverse set of heterogeneous (seen and unseen) anomaly distributions and then utilizes them to learn a unified heterogeneous abnormality model. Further, AHL is a generic framework that existing OSAD models can plug and play for enhancing their abnormality modeling. Extensive experiments on nine real-world anomaly detection datasets show that AHL can 1) substantially enhance different state-of-the-art (SOTA) OSAD models in detecting both seen and unseen anomalies, achieving new SOTA performance on a large set of datasets, and 2) effectively generalize to unseen anomalies in new target domains.
翻译:开放集监督异常检测(OSAD)——一个近期兴起的研究领域——旨在利用训练过程中所见异常类别的少量样本来检测未见异常(即来自开放集异常类别的样本),同时有效识别所见异常。得益于所见异常提供的先验知识,现有OSAD方法通常能大幅降低误报率。然而,这些方法将异常样本视为来自同质分布,使其在泛化至可能来自任意分布的未见异常时效率降低。本文提出利用有限异常样本学习异质性异常分布以解决该问题。为此,我们引入了一种名为异常异质性学习(AHL)的新方法,该方法模拟一组多样化的异质性(所见与未见)异常分布,并利用这些分布学习统一的异质性异常模型。此外,AHL是一个通用框架,现有OSAD模型可即插即用,以增强其异常建模能力。在九个真实世界异常检测数据集上的大量实验表明,AHL能够:1)显著增强不同最先进(SOTA)OSAD模型对所见与未见异常的检测能力,在多个数据集上实现新的SOTA性能;2)有效泛化至新目标域中的未见异常。