Denoising Diffusion Probabilistic Model for Retinal Image Generation and Segmentation

Experts use retinal images and vessel trees to detect and diagnose various eye, blood circulation, and brain-related diseases. However, manual segmentation of retinal images is a time-consuming process that requires high expertise and is difficult due to privacy issues. Many methods have been proposed to segment images, but the need for large retinal image datasets limits the performance of these methods. Several methods synthesize deep learning models based on Generative Adversarial Networks (GAN) to generate limited sample varieties. This paper proposes a novel Denoising Diffusion Probabilistic Model (DDPM) that outperformed GANs in image synthesis. We developed a Retinal Trees (ReTree) dataset consisting of retinal images, corresponding vessel trees, and a segmentation network based on DDPM trained with images from the ReTree dataset. In the first stage, we develop a two-stage DDPM that generates vessel trees from random numbers belonging to a standard normal distribution. Later, the model is guided to generate fundus images from given vessel trees and random distribution. The proposed dataset has been evaluated quantitatively and qualitatively. Quantitative evaluation metrics include Frechet Inception Distance (FID) score, Jaccard similarity coefficient, Cohen's kappa, Matthew's Correlation Coefficient (MCC), precision, recall, F1-score, and accuracy. We trained the vessel segmentation model with synthetic data to validate our dataset's efficiency and tested it on authentic data. Our developed dataset and source code is available at https://github.com/AAleka/retree.

翻译：专家利用视网膜图像及血管树来检测和诊断多种眼部、血液循环及脑部相关疾病。然而，手动分割视网膜图像是一项耗时且需要高度专业知识的过程，同时因隐私问题而困难重重。目前已提出多种图像分割方法，但大规模视网膜图像数据集的缺乏限制了这些方法的性能。一些方法基于生成对抗网络（GAN）构建深度学习模型的合成技术，以生成有限样本类别。本文提出一种新颖的去噪扩散概率模型（DDPM），其在图像合成中表现优于生成对抗网络。我们构建了包含视网膜图像及对应血管树的视网膜树（ReTree）数据集，并基于DDPM开发了一个使用ReTree数据集图像训练的分割网络。第一阶段，我们开发了一个两阶段DDPM，用于从服从标准正态分布的随机数生成血管树。随后，引导模型从给定血管树和随机分布生成眼底图像。所提出的数据集已通过定量和定性评估。定量评估指标包括Fréchet初始距离（FID）分数、Jaccard相似系数、Cohen's kappa、马修斯相关系数（MCC）、精确率、召回率、F1分数及准确率。我们使用合成数据训练血管分割模型以验证数据集的有效性，并在真实数据上进行测试。所开发的数据集及源代码可于 https://github.com/AAleka/retree 获取。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日