MA-VAE: Multi-head Attention-based Variational Autoencoder Approach for Anomaly Detection in Multivariate Time-series Applied to Automotive Endurance Powertrain Testing

2023 年 9 月 5 日

翻译：MA-VAE：基于多头注意力变分自编码器的多元时间序列异常检测方法及其在汽车耐久性动力总成测试中的应用

Lucas Correia,Jan-Christoph Goos,Philipp Klein,Thomas Bäck,Anna V. Kononova

from arxiv, Accepted in NCTA2023

A clear need for automatic anomaly detection applied to automotive testing has emerged as more and more attention is paid to the data recorded and manual evaluation by humans reaches its capacity. Such real-world data is massive, diverse, multivariate and temporal in nature, therefore requiring modelling of the testee behaviour. We propose a variational autoencoder with multi-head attention (MA-VAE), which, when trained on unlabelled data, not only provides very few false positives but also manages to detect the majority of the anomalies presented. In addition to that, the approach offers a novel way to avoid the bypass phenomenon, an undesirable behaviour investigated in literature. Lastly, the approach also introduces a new method to remap individual windows to a continuous time series. The results are presented in the context of a real-world industrial data set and several experiments are undertaken to further investigate certain aspects of the proposed model. When configured properly, it is 9% of the time wrong when an anomaly is flagged and discovers 67% of the anomalies present. Also, MA-VAE has the potential to perform well with only a fraction of the training and validation subset, however, to extract it, a more sophisticated threshold estimation method is required.

翻译：随着人们对记录数据的关注度日益提升，且人工评估能力已达极限，汽车测试领域对自动异常检测的需求愈发明确。此类真实数据具有大规模、多样化、多变量及时间序列特性，因此需要对被测对象的行为进行建模。我们提出了一种带多头注意力的变分自编码器（MA-VAE），该模型在无标注数据上训练时，不仅能提供极少的误报，还能成功检测出大多数异常。此外，该方法提供了一种新颖方式以避免文献中探讨的不良行为——"绕过现象"。最后，该方法还引入了一种将独立窗口重新映射为连续时间序列的新技术。基于真实工业数据集展示了实验结果，并开展了多项实验以进一步探究所提模型的特定方面。在合理配置下，当异常被标记时，模型误判概率仅为9%，并能发现67%的异常。同时，MA-VAE仅需训练和验证子集的一小部分即可表现出良好性能，但为实现该潜力，需采用更先进的阈值估计方法。

相关内容

自编码器

关注 141

自动编码器是一种人工神经网络，用于以无监督的方式学习有效的数据编码。自动编码器的目的是通过训练网络忽略信号“噪声”来学习一组数据的表示（编码），通常用于降维。与简化方面一起，学习了重构方面，在此，自动编码器尝试从简化编码中生成尽可能接近其原始输入的表示形式，从而得到其名称。基本模型存在几种变体，其目的是迫使学习的输入表示形式具有有用的属性。自动编码器可有效地解决许多应用问题，从面部识别到获取单词的语义。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日