DynaSeg: A Deep Dynamic Fusion Method for Unsupervised Image Segmentation Incorporating Feature Similarity and Spatial Continuity

Our work tackles the fundamental challenge of image segmentation in computer vision, which is crucial for diverse applications. While supervised methods demonstrate proficiency, their reliance on extensive pixel-level annotations limits scalability. In response to this challenge, we present an enhanced unsupervised Convolutional Neural Network (CNN)-based algorithm called DynaSeg. Unlike traditional approaches that rely on a fixed weight factor to balance feature similarity and spatial continuity, requiring manual adjustments, our novel, dynamic weighting scheme automates parameter tuning, adapting flexibly to image details. We also introduce the novel concept of a Silhouette Score Phase that addresses the challenge of dynamic clustering during iterations. Additionally, our methodology integrates both CNN-based and pre-trained ResNet feature extraction, offering a comprehensive and adaptable approach. We achieve state-of-the-art results on diverse datasets, with a notable 12.2% and 14.12% mIOU improvement compared to the current benchmarks on COCO-All and COCO-Stuff, respectively. The proposed approach unlocks the potential for unsupervised image segmentation and addresses scalability concerns in real-world scenarios by obviating the need for meticulous parameter tuning.

翻译：本研究聚焦于计算机视觉中图像分割这一基础性难题，该技术对诸多应用领域至关重要。虽然监督方法展现出优越性能，但其对大规模像素级标注的依赖限制了可扩展性。针对这一挑战，我们提出了一种增强型无监督卷积神经网络（CNN）算法，命名为DynaSeg。不同于传统方法采用固定权重因子来平衡特征相似性与空间连续性（需人工调整参数），我们创新的动态权重方案实现了参数自动化调优，能灵活适应图像细节特征。我们还引入了轮廓系数阶段（Silhouette Score Phase）这一新概念，解决了迭代过程中动态聚类面临的挑战。此外，本方法融合了基于CNN的特征提取与预训练ResNet特征提取，提供了一套全面而自适应的解决方案。在多个数据集上，我们的方法达到了最优性能：与当前基准方法相比，在COCO-All和COCO-Stuff数据集上的平均交并比（mIOU）分别提升了12.2%和14.12%。该方案通过消除繁琐的参数调优需求，不仅释放了无监督图像分割的潜力，也解决了实际应用场景中的可扩展性问题。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日