Weakly-Supervised Concealed Object Segmentation with SAM-based Pseudo Labeling and Multi-scale Feature Grouping

Weakly-Supervised Concealed Object Segmentation (WSCOS) aims to segment objects well blended with surrounding environments using sparsely-annotated data for model training. It remains a challenging task since (1) it is hard to distinguish concealed objects from the background due to the intrinsic similarity and (2) the sparsely-annotated training data only provide weak supervision for model learning. In this paper, we propose a new WSCOS method to address these two challenges. To tackle the intrinsic similarity challenge, we design a multi-scale feature grouping module that first groups features at different granularities and then aggregates these grouping results. By grouping similar features together, it encourages segmentation coherence, helping obtain complete segmentation results for both single and multiple-object images. For the weak supervision challenge, we utilize the recently-proposed vision foundation model, Segment Anything Model (SAM), and use the provided sparse annotations as prompts to generate segmentation masks, which are used to train the model. To alleviate the impact of low-quality segmentation masks, we further propose a series of strategies, including multi-augmentation result ensemble, entropy-based pixel-level weighting, and entropy-based image-level selection. These strategies help provide more reliable supervision to train the segmentation model. We verify the effectiveness of our method on various WSCOS tasks, and experiments demonstrate that our method achieves state-of-the-art performance on these tasks.

翻译：弱监督隐蔽目标分割（Weakly-Supervised Concealed Object Segmentation, WSCOS）旨在利用稀疏标注数据训练模型，以分割与周围环境高度融合的目标。该任务具有较大挑战性，原因在于：(1) 由于内在相似性，隐蔽目标与背景难以区分；(2) 稀疏标注的训练数据只能为模型学习提供弱监督信号。本文针对上述两个挑战提出一种新的WSCOS方法。为解决内在相似性挑战，我们设计了多尺度特征分组模块，该模块首先对不同粒度的特征进行分组，随后聚合这些分组结果。通过将相似特征聚合在一起，该方法可增强分割连贯性，有助于在单目标和多目标图像中获取完整的分割结果。针对弱监督挑战，我们利用近期提出的视觉基础模型——Segment Anything Model（SAM），并将提供的稀疏标注作为提示生成分割掩码，用于训练模型。为缓解低质量分割掩码的影响，我们进一步提出一系列策略，包括多增强结果集成、基于熵的像素级加权以及基于熵的图像级筛选。这些策略可为训练分割模型提供更可靠的监督信号。我们在多种WSCOS任务上验证了方法的有效性，实验结果表明，该方法在这些任务上取得了最先进的性能。

相关内容

GROUP

关注 1

Group一直是研究计算机支持的合作工作、人机交互、计算机支持的协作学习和社会技术研究的主要场所。该会议将社会科学、计算机科学、工程、设计、价值观以及其他与小组工作相关的多个不同主题的工作结合起来，并进行了广泛的概念化。官网链接：https://group.acm.org/conferences/group20/

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日