Semantic segmentation models classify pixels into a set of known (``in-distribution'') visual classes. When deployed in an open world, the reliability of these models depends on their ability not only to classify in-distribution pixels but also to detect out-of-distribution (OoD) pixels. Historically, the poor OoD detection performance of these models has motivated the design of methods based on model re-training using synthetic training images that include OoD visual objects. Although successful, these re-trained methods have two issues: 1) their in-distribution segmentation accuracy may drop during re-training, and 2) their OoD detection accuracy does not generalise well to new contexts (e.g., country surroundings) outside the training set (e.g., city surroundings). In this paper, we mitigate these issues with: (i) a new residual pattern learning (RPL) module that assists the segmentation model to detect OoD pixels without affecting the inlier segmentation performance; and (ii) a novel context-robust contrastive learning (CoroCL) that enforces RPL to robustly detect OoD pixels among various contexts. Our approach improves by around 10\% FPR and 7\% AuPRC the previous state-of-the-art in Fishyscapes, Segment-Me-If-You-Can, and RoadAnomaly datasets. Our code is available at: https://github.com/yyliu01/RPL.
翻译:语义分割模型将像素分类为一组已知的(“分布内”)视觉类别。当在开放世界中部署时,这些模型的可靠性不仅取决于它们对分布内像素的分类能力,还取决于它们检测分布外(OoD)像素的能力。历史上,这些模型较差的OoD检测性能促使了基于使用包含OoD视觉对象的合成训练图像进行模型重训练的方法的设计。尽管这些重训练方法取得了成功,但存在两个问题:1)在重训练过程中,其分布内分割精度可能下降;2)其OoD检测精度对训练集(如城市环境)之外的新环境(如乡村环境)泛化能力较差。在本文中,我们通过以下方法缓解了这些问题:(i)一种新的残差模式学习(RPL)模块,该模块辅助分割模型检测OoD像素而不影响内点分割性能;(ii)一种新颖的上下文鲁棒对比学习(CoroCL),它强制RPL在各种上下文中稳健地检测OoD像素。我们的方法在Fishyscapes、Segment-Me-If-You-Can和RoadAnomaly数据集上将先前最先进方法的FPR降低了约10%,AuPRC提高了约7%。我们的代码可在https://github.com/yyliu01/RPL获取。