Residual Pattern Learning for Pixel-wise Out-of-Distribution Detection in Semantic Segmentation

Semantic segmentation models classify pixels into a set of known (``in-distribution'') visual classes. When deployed in an open world, the reliability of these models depends on their ability not only to classify in-distribution pixels but also to detect out-of-distribution (OoD) pixels. Historically, the poor OoD detection performance of these models has motivated the design of methods based on model re-training using synthetic training images that include OoD visual objects. Although successful, these re-trained methods have two issues: 1) their in-distribution segmentation accuracy may drop during re-training, and 2) their OoD detection accuracy does not generalise well to new contexts (e.g., country surroundings) outside the training set (e.g., city surroundings). In this paper, we mitigate these issues with: (i) a new residual pattern learning (RPL) module that assists the segmentation model to detect OoD pixels without affecting the inlier segmentation performance; and (ii) a novel context-robust contrastive learning (CoroCL) that enforces RPL to robustly detect OoD pixels among various contexts. Our approach improves by around 10\% FPR and 7\% AuPRC the previous state-of-the-art in Fishyscapes, Segment-Me-If-You-Can, and RoadAnomaly datasets. Our code is available at: https://github.com/yyliu01/RPL.

翻译：语义分割模型将像素分类到一组已知的（“分布内”）视觉类别中。当在开放世界中部署时，这些模型的可靠性不仅取决于其对分布内像素进行分类的能力，还取决于其检测分布外（OoD）像素的能力。历史上，这些模型较差的OoD检测性能促使人们设计基于模型重新训练的方法，这些方法使用包含OoD视觉对象的合成训练图像。尽管这些重新训练的方法取得了成功，但它们存在两个问题：1）在重新训练过程中，其分布内分割精度可能会下降；2）其OoD检测精度不能很好地泛化到训练集（如城市环境）之外的新场景（如乡村环境）。在本文中，我们通过以下方法缓解了这些问题：（i）一种新的残差模式学习（RPL）模块，该模块辅助分割模型检测OoD像素，同时不影响内点分割性能；（ii）一种新颖的上下文鲁棒对比学习（CoroCL），它强制RPL在各种上下文中稳健地检测OoD像素。我们的方法在Fishyscapes、Segment-Me-If-You-Can和RoadAnomaly数据集上将之前的先进水平提高了约10%的假阳性率（FPR）和7%的精确率-召回率曲线下面积（AuPRC）。我们的代码可在 https://github.com/yyliu01/RPL 获取。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日