Integrating high-level semantically correlated contents and low-level anatomical features is of central importance in medical image segmentation. Towards this end, recent deep learning-based medical segmentation methods have shown great promise in better modeling such information. However, convolution operators for medical segmentation typically operate on regular grids, which inherently blur the high-frequency regions, i.e., boundary regions. In this work, we propose MORSE, a generic implicit neural rendering framework designed at an anatomical level to assist learning in medical image segmentation. Our method is motivated by the fact that implicit neural representation has been shown to be more effective in fitting complex signals and solving computer graphics problems than discrete grid-based representation. The core of our approach is to formulate medical image segmentation as a rendering problem in an end-to-end manner. Specifically, we continuously align the coarse segmentation prediction with the ambiguous coordinate-based point representations and aggregate these features to adaptively refine the boundary region. To parallelly optimize multi-scale pixel-level features, we leverage the idea from Mixture-of-Expert (MoE) to design and train our MORSE with a stochastic gating mechanism. Our experiments demonstrate that MORSE can work well with different medical segmentation backbones, consistently achieving competitive performance improvements in both 2D and 3D supervised medical segmentation methods. We also theoretically analyze the superiority of MORSE.
翻译:整合高层语义相关内容与低层解剖特征在医学图像分割中至关重要。为此,基于深度学习的现代医学分割方法在更优建模此类信息方面展现出巨大潜力。然而,用于医学分割的卷积算子通常在规则网格上运行,这本质会导致高频区域(即边界区域)的模糊。本文提出MORSE——一种在解剖层面设计的通用隐式神经渲染框架,旨在辅助医学图像分割学习。我们的方法受以下事实启发:相较于离散网格表示,隐式神经表示在拟合复杂信号和解决计算机图形学问题中已被证明更有效。该框架核心在于将医学图像分割端到端地形式化为渲染问题:具体而言,我们持续将粗分割预测与模糊的基于坐标的点表示对齐,并通过聚合这些特征自适应优化边界区域。为并行优化多尺度像素级特征,我们借鉴混合专家(MoE)思想,通过随机门控机制设计并训练MORSE。实验表明,MORSE能够与不同医学分割骨干网络良好兼容,在2D和3D有监督医学分割方法中均持续取得竞争性的性能提升。同时,本文从理论上分析了MORSE的优越性。