ProMamba: Prompt-Mamba for polyp segmentation

Detecting polyps through colonoscopy is an important task in medical image segmentation, which provides significant assistance and reference value for clinical surgery. However, accurate segmentation of polyps is a challenging task due to two main reasons. Firstly, polyps exhibit various shapes and colors. Secondly, the boundaries between polyps and their normal surroundings are often unclear. Additionally, significant differences between different datasets lead to limited generalization capabilities of existing methods. To address these issues, we propose a segmentation model based on Prompt-Mamba, which incorporates the latest Vision-Mamba and prompt technologies. Compared to previous models trained on the same dataset, our model not only maintains high segmentation accuracy on the validation part of the same dataset but also demonstrates superior accuracy on unseen datasets, exhibiting excellent generalization capabilities. Notably, we are the first to apply the Vision-Mamba architecture to polyp segmentation and the first to utilize prompt technology in a polyp segmentation model. Our model efficiently accomplishes segmentation tasks, surpassing previous state-of-the-art methods by an average of 5% across six datasets. Furthermore, we have developed multiple versions of our model with scaled parameter counts, achieving better performance than previous models even with fewer parameters. Our code and trained weights will be released soon.

翻译：通过结肠镜检测息肉是医学图像分割中的一项重要任务，为临床手术提供了重要的辅助和参考价值。然而，由于两个主要原因，息肉的精确分割极具挑战性。首先，息肉具有多样的形态和颜色。其次，息肉与正常周围组织之间的边界往往不清晰。此外，不同数据集之间的显著差异导致现有方法的泛化能力有限。针对这些问题，我们提出了一种基于Prompt-Mamba的分割模型，该模型融合了最新的Vision-Mamba和提示技术。与以往在同一数据集上训练的模型相比，我们的模型不仅在同一数据集的验证部分保持了较高的分割精度，而且在未见过的数据集上也展现了优越的准确率，呈现出优异的泛化能力。值得注意的是，我们是首次将Vision-Mamba架构应用于息肉分割，并首次在息肉分割模型中利用提示技术。我们的模型高效地完成了分割任务，在六个数据集上平均超越了先前最先进方法5%。此外，我们还开发了参数量缩放的多个版本模型，即便在参数更少的情况下也取得了优于先前模型的性能。我们的代码和训练权重将很快发布。

相关内容

MoDELS

关注 46

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

【AI应用】Facebook-利用神经网络求解高等数学方程, Using neural networks to solve advanced mathematics equations

专知会员服务

34+阅读 · 2020年1月15日