Precision matters: Precision-aware ensemble for weakly supervised semantic segmentation

Weakly Supervised Semantic Segmentation (WSSS) employs weak supervision, such as image-level labels, to train the segmentation model. Despite the impressive achievement in recent WSSS methods, we identify that introducing weak labels with high mean Intersection of Union (mIoU) does not guarantee high segmentation performance. Existing studies have emphasized the importance of prioritizing precision and reducing noise to improve overall performance. In the same vein, we propose ORANDNet, an advanced ensemble approach tailored for WSSS. ORANDNet combines Class Activation Maps (CAMs) from two different classifiers to increase the precision of pseudo-masks (PMs). To further mitigate small noise in the PMs, we incorporate curriculum learning. This involves training the segmentation model initially with pairs of smaller-sized images and corresponding PMs, gradually transitioning to the original-sized pairs. By combining the original CAMs of ResNet-50 and ViT, we significantly improve the segmentation performance over the single-best model and the naive ensemble model, respectively. We further extend our ensemble method to CAMs from AMN (ResNet-like) and MCTformer (ViT-like) models, achieving performance benefits in advanced WSSS models. It highlights the potential of our ORANDNet as a final add-on module for WSSS models.

翻译：弱监督语义分割（WSSS）利用图像级标签等弱监督信息来训练分割模型。尽管近期的WSSS方法取得了显著进展，但我们发现引入具有高平均交并比（mIoU）的弱标签并不能保证获得高的分割性能。现有研究强调了优先考虑精度并降低噪声对提升整体性能的重要性。基于相同思路，我们提出了ORANDNet，一种专为WSSS设计的先进集成方法。ORANDNet通过结合来自两个不同分类器的类激活图（CAMs）来提高伪掩码（PMs）的精度。为了进一步抑制PMs中的细微噪声，我们引入了课程学习策略。该策略首先使用较小尺寸的图像及其对应PMs对分割模型进行训练，随后逐步过渡到原始尺寸的图像-PM对进行训练。通过集成ResNet-50与ViT的原始CAMs，我们的方法相较于单一最佳模型及朴素集成模型，均显著提升了分割性能。我们进一步将该集成方法扩展应用于AMN（类ResNet架构）与MCTformer（类ViT架构）模型生成的CAMs，在先进的WSSS模型中实现了性能增益。这凸显了ORANDNet作为WSSS模型最终附加模块的潜力。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日