MEGANet: Multi-Scale Edge-Guided Attention Network for Weak Boundary Polyp Segmentation

Efficient polyp segmentation in healthcare plays a critical role in enabling early diagnosis of colorectal cancer. However, the segmentation of polyps presents numerous challenges, including the intricate distribution of backgrounds, variations in polyp sizes and shapes, and indistinct boundaries. Defining the boundary between the foreground (i.e. polyp itself) and the background (surrounding tissue) is difficult. To mitigate these challenges, we propose Multi-Scale Edge-Guided Attention Network (MEGANet) tailored specifically for polyp segmentation within colonoscopy images. This network draws inspiration from the fusion of a classical edge detection technique with an attention mechanism. By combining these techniques, MEGANet effectively preserves high-frequency information, notably edges and boundaries, which tend to erode as neural networks deepen. MEGANet is designed as an end-to-end framework, encompassing three key modules: an encoder, which is responsible for capturing and abstracting the features from the input image, a decoder, which focuses on salient features, and the Edge-Guided Attention module (EGA) that employs the Laplacian Operator to accentuate polyp boundaries. Extensive experiments, both qualitative and quantitative, on five benchmark datasets, demonstrate that our EGANet outperforms other existing SOTA methods under six evaluation metrics. Our code is available at \url{https://github.com/UARK-AICV/MEGANet}.

翻译：在医疗保健中，高效的息肉分割对于结直肠癌的早期诊断至关重要。然而，息肉分割面临诸多挑战，包括背景分布复杂、息肉大小和形状多变以及边界模糊。定义前景（即息肉本身）与背景（周围组织）之间的边界十分困难。为应对这些挑战，我们提出了一种专门用于结肠镜图像中息肉分割的多尺度边缘引导注意力网络（MEGANet）。该网络受经典边缘检测技术与注意力机制融合的启发。通过结合这些技术，MEGANet有效保留了高频信息，尤其是边缘和边界——这些信息会随神经网络深度增加而逐渐丢失。MEGANet设计为端到端框架，包含三个关键模块：编码器（负责从输入图像中捕获和抽象特征）、解码器（聚焦于显著特征）以及边缘引导注意力模块（EGA），该模块利用拉普拉斯算子强化息肉边界。在五个基准数据集上进行的定性和定量实验表明，我们的MEGANet在六项评估指标下优于现有其他最先进方法。我们的代码已开源在\url{https://github.com/UARK-AICV/MEGANet}。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日