A SAM-guided Two-stream Lightweight Model for Anomaly Detection

In industrial anomaly detection, model efficiency and mobile-friendliness become the primary concerns in real-world applications. Simultaneously, the impressive generalization capabilities of Segment Anything (SAM) have garnered broad academic attention, making it an ideal choice for localizing unseen anomalies and diverse real-world patterns. In this paper, considering these two critical factors, we propose a SAM-guided Two-stream Lightweight Model for unsupervised anomaly detection (STLM) that not only aligns with the two practical application requirements but also harnesses the robust generalization capabilities of SAM. We employ two lightweight image encoders, i.e., our two-stream lightweight module, guided by SAM's knowledge. To be specific, one stream is trained to generate discriminative and general feature representations in both normal and anomalous regions, while the other stream reconstructs the same images without anomalies, which effectively enhances the differentiation of two-stream representations when facing anomalous regions. Furthermore, we employ a shared mask decoder and a feature aggregation module to generate anomaly maps. Our experiments conducted on MVTec AD benchmark show that STLM, with about 16M parameters and achieving an inference time in 20ms, competes effectively with state-of-the-art methods in terms of performance, 98.26% on pixel-level AUC and 94.92% on PRO. We further experiment on more difficult datasets, e.g., VisA and DAGM, to demonstrate the effectiveness and generalizability of STLM.

翻译：在工业异常检测中，模型效率与移动端友好性是实际应用中的核心关注点。与此同时，Segment Anything（SAM）模型卓越的泛化能力引起了学界的广泛关注，使其成为定位未见异常与多样化现实模式的理想选择。本文综合考虑上述两个关键因素，提出了一种基于SAM引导的双流轻量级无监督异常检测模型（STLM），该模型不仅满足实际应用需求，同时有效利用SAM强大的泛化能力。我们采用两个轻量级图像编码器（即双流轻量模块）并引入SAM的知识进行引导。具体而言，一个流被训练为在正常与异常区域生成具有判别性与通用性的特征表示，而另一个流则重构不含异常的同幅图像，从而有效增强双流表示面对异常区域时的区分能力。此外，我们采用共享掩码解码器与特征聚合模块生成异常图。在MVTec AD基准上的实验表明，STLM仅需约1600万参数、推理时间20毫秒，即可与最先进方法在性能上展开竞争——像素级AUC达98.26%，PRO指标达94.92%。我们进一步在更具挑战性的数据集（如VisA与DAGM）上开展实验，验证了STLM的有效性与泛化能力。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/