Conditional Diffusion Models for Semantic 3D Brain MRI Synthesis

from arxiv, This document is a preprint and has been accepted for publication in the IEEE Journal of Biomedical and Health Informatics. The final, published version can be accessed using the following DOI: 10.1109/JBHI.2024.3385504. Copyright for this article has been transferred to IEEE

Artificial intelligence (AI) in healthcare, especially in medical imaging, faces challenges due to data scarcity and privacy concerns. Addressing these, we introduce Med-DDPM, a diffusion model designed for 3D semantic brain MRI synthesis. This model effectively tackles data scarcity and privacy issues by integrating semantic conditioning. This involves the channel-wise concatenation of a conditioning image to the model input, enabling control in image generation. Med-DDPM demonstrates superior stability and performance compared to existing 3D brain imaging synthesis methods. It generates diverse, anatomically coherent images with high visual fidelity. In terms of dice score accuracy in the tumor segmentation task, Med-DDPM achieves 0.6207, close to the 0.6531 accuracy of real images, and outperforms baseline models. Combined with real images, it further increases segmentation accuracy to 0.6675, showing the potential of our proposed method for data augmentation. This model represents the first use of a diffusion model in 3D semantic brain MRI synthesis, producing high-quality images. Its semantic conditioning feature also shows potential for image anonymization in biomedical imaging, addressing data and privacy issues. We provide the code and model weights for Med-DDPM on our GitHub repository (https://github.com/mobaidoctor/med-ddpm/) to support reproducibility.

翻译：医疗领域的人工智能，尤其是医学影像分析，常受数据稀缺与隐私问题困扰。为此，我们提出Med-DDPM——一种专为三维语义脑MRI合成设计的扩散模型。该模型通过整合语义条件生成，有效缓解数据与隐私难题：将条件图像按通道维度拼接至模型输入，实现生成过程的精准控制。与现有三维脑影像合成方法相比，Med-DDPM展现出卓越的稳定性与性能，可生成多样化、解剖结构一致且视觉保真度高的图像。在肿瘤分割任务的Dice系数评估中，Med-DDPM达0.6207，逼近真实图像的0.6531，并显著优于基线模型。将该模型与真实图像联合使用时，分割精度进一步提升至0.6675，充分彰显本方法在数据增强领域的潜力。作为首个将扩散模型应用于三维语义脑MRI合成的研究，Med-DDPM不仅能生成高质量图像，其语义条件特性更在生物医学图像匿名化方面展现重要价值，有效应对数据与隐私挑战。为支持研究可复现性，我们已在GitHub仓库（https://github.com/mobaidoctor/med-ddpm/）公开Med-DDPM代码与模型权重。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/