Synthetic data generated by generative models can enhance the performance and capabilities of data-hungry deep learning models in medical imaging. However, there is (1) limited availability of (synthetic) datasets and (2) generative models are complex to train, which hinders their adoption in research and clinical applications. To reduce this entry barrier, we propose medigan, a one-stop shop for pretrained generative models implemented as an open-source framework-agnostic Python library. medigan allows researchers and developers to create, increase, and domain-adapt their training data in just a few lines of code. Guided by design decisions based on gathered end-user requirements, we implement medigan based on modular components for generative model (i) execution, (ii) visualisation, (iii) search & ranking, and (iv) contribution. The library's scalability and design is demonstrated by its growing number of integrated and readily-usable pretrained generative models consisting of 21 models utilising 9 different Generative Adversarial Network architectures trained on 11 datasets from 4 domains, namely, mammography, endoscopy, x-ray, and MRI. Furthermore, 3 applications of medigan are analysed in this work, which include (a) enabling community-wide sharing of restricted data, (b) investigating generative model evaluation metrics, and (c) improving clinical downstream tasks. In (b), extending on common medical image synthesis assessment and reporting standards, we show Fr\'echet Inception Distance variability based on image normalisation and radiology-specific feature extraction.
翻译:生成模型生成的合成数据可以提升医学影像中数据密集型的深度学习模型的性能和能力。然而,目前存在(1)合成数据集的可用性有限,以及(2)生成模型训练复杂的问题,这阻碍了它们在研究和临床中的应用。为降低这一门槛,我们提出medigan,这是一个一站式预训练生成模型库,实现为开源且框架无关的Python库。medigan使研究人员和开发者仅需几行代码即可创建、扩充和领域适配其训练数据。基于收集的终端用户需求的设计决策,我们通过模块化组件实现了medigan的生成模型(i)执行、(ii)可视化、(iii)搜索与排序,以及(iv)贡献功能。该库的可扩展性和设计通过其集成且可立即使用的预训练生成模型数量的增长得到验证,其中包括21个模型,这些模型使用了9种不同的生成对抗网络架构,在来自4个领域(即乳腺摄影、内窥镜、X射线和MRI)的11个数据集上进行了训练。此外,本文分析了medigan的3种应用,包括(a)实现受限数据的社区共享、(b)研究生成模型评估指标,以及(c)改善临床下游任务。在(b)中,我们扩展了常见的医学图像合成评估和报告标准,展示了基于图像归一化和放射学特定特征提取的Fréchet初始距离变异。