Generalization capabilities of learning-based medical image segmentation across domains are currently limited by the performance degradation caused by the domain shift, particularly for ultrasound (US) imaging. The quality of US images heavily relies on carefully tuned acoustic parameters, which vary across sonographers, machines, and settings. To improve the generalizability on US images across domains, we propose MI-SegNet, a novel mutual information (MI) based framework to explicitly disentangle the anatomical and domain feature representations; therefore, robust domain-independent segmentation can be expected. Two encoders are employed to extract the relevant features for the disentanglement. The segmentation only uses the anatomical feature map for its prediction. In order to force the encoders to learn meaningful feature representations a cross-reconstruction method is used during training. Transformations, specific to either domain or anatomy are applied to guide the encoders in their respective feature extraction task. Additionally, any MI present in both feature maps is punished to further promote separate feature spaces. We validate the generalizability of the proposed domain-independent segmentation approach on several datasets with varying parameters and machines. Furthermore, we demonstrate the effectiveness of the proposed MI-SegNet serving as a pre-trained model by comparing it with state-of-the-art networks.
翻译:基于学习的医学图像分割在跨领域场景下的泛化能力目前受限于领域偏移导致的性能下降,尤其在超声成像领域。超声图像质量高度依赖于精细调节的声学参数,这些参数会因操作者、设备和设置的不同而变化。为提升超声图像在跨领域场景下的泛化性能,我们提出MI-SegNet——一种基于互信息的新型框架,通过显式解耦解剖特征与领域特征表示,从而实现鲁棒的领域无关分割。该框架采用两个编码器分别提取解耦所需的相关特征,分割任务仅使用解剖特征图进行预测。为迫使编码器学习有意义的特征表示,训练过程中采用交叉重建方法。通过分别施加领域特异性与解剖特异性变换,引导编码器完成各自的特征提取任务。此外,两个特征图中存在的互信息将被惩罚,以进一步促进特征空间的分离。我们在包含不同参数与设备的多个数据集上验证了所提出的领域无关分割方法的泛化能力,并通过与当前最优网络的对比,证明了MI-SegNet作为预训练模型的有效性。