This work proposes and investigates a novel method for anomaly detection and shows it to be competitive in a variety of Euclidean and non-Euclidean situations. It is based on an extension of the depth quantile functions (DQF) approach. The DQF approach encodes geometric information about a point cloud via functions of a single variable, whereas each observation in a data set is associated with a single such function. Plotting these functions provides a very beneficial visualization aspect. This technique can be applied to any data lying in a Hilbert space. The proposed anomaly detection approach is motivated by the geometric insight of the presence of anomalies in data being tied to the existence of antimodes in the data generating distribution. Coupling this insight with novel theoretical understanding into the shape of the DQFs gives rise to the proposed adaptive DQF (aDQF) methodology. Applications to various data sets illustrate the DQF and aDQF's strong anomaly detection performance, and the benefits of its visualization aspects.
翻译:本文提出并研究了一种新颖的异常检测方法,并证明其在多种欧几里得和非欧几里得场景下具有竞争力。该方法基于深度分位函数(DQF)方法的扩展。DQF方法通过单变量函数编码点云中的几何信息,而数据集中的每个观测值都对应一个这样的函数。绘制这些函数提供了极有利的可视化效果。该技术可应用于任何位于希尔伯特空间中的数据。所提出的异常检测方法源于对数据中异常存在与数据生成分布中反模态存在相关联的几何洞见。将该洞见与对DQF形状的新理论理解相结合,催生了所提出的自适应DQF(aDQF)方法。对各类数据集的应用展示了DQF和aDQF强大的异常检测性能及其可视化优势。