Weakly Supervised Semantic Segmentation (WSSS) relying only on image-level supervision is a promising approach to deal with the need for Segmentation networks, especially for generating a large number of pixel-wise masks in a given dataset. However, most state-of-the-art image-level WSSS techniques lack an understanding of the geometric features embedded in the images since the network cannot derive any object boundary information from just image-level labels. We define a boundary here as the line separating an object and its background, or two different objects. To address this drawback, we are proposing our novel ReFit framework, which deploys state-of-the-art class activation maps combined with various post-processing techniques in order to achieve fine-grained higher-accuracy segmentation masks. To achieve this, we investigate a state-of-the-art unsupervised segmentation network that can be used to construct a boundary map, which enables ReFit to predict object locations with sharper boundaries. By applying our method to WSSS predictions, we achieved up to 10% improvement over the current state-of-the-art WSSS methods for medical imaging. The framework is open-source, to ensure that our results are reproducible, and accessible online at https://github.com/bharathprabakaran/ReFit.
翻译:仅依赖图像级标签的弱监督语义分割(WSSS)是应对分割网络需求的有效方法,尤其适用于为特定数据集生成大量像素级掩码。然而,当前最先进的图像级WSSS技术缺乏对图像中嵌入的几何特征的理解,因为网络无法仅从图像级标签中获取对象边界信息(此处边界定义为对象与其背景、或两个不同对象的分隔线)。为解决这一缺陷,我们提出创新性ReFit框架,该框架结合最先进的类激活图与多种后处理技术,以实现细粒度、高精度的分割掩码。为此,我们探索了一种可用于构建边界图的最先进无监督分割网络,使ReFit能够以更清晰的边界预测对象位置。将我们的方法应用于WSSS预测后,在医学图像领域较当前最先进WSSS方法实现了高达10%的性能提升。本框架为开源项目,以确保结果可复现,代码已发布于https://github.com/bharathprabakaran/ReFit。