End-to-end autoencoding architecture for the simultaneous generation of medical images and corresponding segmentation masks

Despite the increasing use of deep learning in medical image segmentation, acquiring sufficient training data remains a challenge in the medical field. In response, data augmentation techniques have been proposed; however, the generation of diverse and realistic medical images and their corresponding masks remains a difficult task, especially when working with insufficient training sets. To address these limitations, we present an end-to-end architecture based on the Hamiltonian Variational Autoencoder (HVAE). This approach yields an improved posterior distribution approximation compared to traditional Variational Autoencoders (VAE), resulting in higher image generation quality. Our method outperforms generative adversarial architectures under data-scarce conditions, showcasing enhancements in image quality and precise tumor mask synthesis. We conduct experiments on two publicly available datasets, MICCAI's Brain Tumor Segmentation Challenge (BRATS), and Head and Neck Tumor Segmentation Challenge (HECKTOR), demonstrating the effectiveness of our method on different medical imaging modalities.

翻译：尽管深度学习在医学图像分割中的应用日益广泛，但获取充足的训练数据仍是医学领域面临的挑战。为此，数据增强技术应运而生，然而生成多样化且逼真的医学图像及其对应掩码仍是难题，尤其在训练集不足的情况下。为克服这些局限，我们提出了一种基于哈密顿变分自编码器（HVAE）的端到端架构。相较于传统变分自编码器（VAE），该方法改进了后验分布逼近效果，从而提升了图像生成质量。在数据稀缺条件下，本方法优于生成对抗网络架构，在图像质量和肿瘤掩码精确合成方面展现出显著改进。我们在两个公开数据集——MICCAI脑肿瘤分割挑战赛（BRATS）与头颈部肿瘤分割挑战赛（HECKTOR）上进行了实验，验证了该方法在不同医学成像模态下的有效性。

相关内容

自编码器

关注 141

自动编码器是一种人工神经网络，用于以无监督的方式学习有效的数据编码。自动编码器的目的是通过训练网络忽略信号“噪声”来学习一组数据的表示（编码），通常用于降维。与简化方面一起，学习了重构方面，在此，自动编码器尝试从简化编码中生成尽可能接近其原始输入的表示形式，从而得到其名称。基本模型存在几种变体，其目的是迫使学习的输入表示形式具有有用的属性。自动编码器可有效地解决许多应用问题，从面部识别到获取单词的语义。

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【AI应用】Facebook-利用神经网络求解高等数学方程, Using neural networks to solve advanced mathematics equations

专知会员服务

34+阅读 · 2020年1月15日

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日