Clinically Aware Synthetic Image Generation for Concept Coverage in Chest X-ray Models

Deep learning models for chest X-ray diagnosis are constrained by limited coverage of clinically meaningful concept combinations in publicly available training datasets. While synthetic image generation has been explored to increase data diversity, existing methods rarely enforce clinical or anatomical constraints, limiting utility for improving model reliability. We propose CARPA, a clinically aware and anatomically grounded framework for synthetic chest X-ray generation that applies targeted perturbations to clinical concept vectors while preserving anatomical structure. By producing anatomically faithful synthetic images with controlled concept insertions and deletions, CARPA expands clinically relevant concept coverage. We evaluate CARPA across seven backbone architectures by fine-tuning models on synthetic subsets and testing on a held-out MIMIC-CXR benchmark. Compared to prior concept perturbation approaches, fine-tuning on CARPA-generated images consistently improves precision-recall performance, reduces predictive uncertainty, and improves model calibration. Structural and semantic analyses demonstrate high anatomical fidelity, strong concept alignment, and low semantic uncertainty. Evaluation by two expert radiologists further confirms realism and clinical agreement. Together, these results show that anatomically grounded concept perturbations enable more effective use of synthetic data, improving both performance and reliability of chest X-ray classification models and supporting safer clinical deployment.

翻译：用于胸部X光诊断的深度学习模型受到公开训练数据集中临床有意义概念组合覆盖范围有限的制约。尽管已有研究探索通过合成图像生成增强数据多样性，但现有方法很少施加临床或解剖约束，限制了其在提升模型可靠性方面的实用性。我们提出CARPA——一种具有临床感知与解剖基础框架的合成胸部X光图像生成方法，该方法在保留解剖结构的同时，对临床概念向量施加目标性扰动。通过生成保留解剖真实性且包含受控的概念插入与删除操作的合成图像，CARPA扩展了临床相关概念的覆盖范围。我们基于七种骨干架构对CARPA进行评估：在合成子集上微调模型，并在保留的MIMIC-CXR基准上进行测试。与先前的概念扰动方法相比，基于CARPA生成图像进行微调，能够持续提升精确率-召回率性能、降低预测不确定性，并改善模型校准效果。结构与语义分析表明，该方法具有高解剖保真度、强概念对齐度以及低语义不确定性。两位放射科专家的评估进一步证实了其真实感与临床一致性。上述结果共同表明，基于解剖基础的概念扰动能够更有效地利用合成数据，从而提升胸部X光分类模型的性能与可靠性，并支持更安全的临床部署。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【博士论文】结合图像与文本以提升医学图像理解

专知会员服务

30+阅读 · 2025年3月1日

【伦敦国王学院博士论文】可信深度学习医学图像分割，270页pdf

专知会员服务

44+阅读 · 2023年6月1日

【CVPR2023】基于动态图增强对比学习的胸部X光报告生成

专知会员服务

21+阅读 · 2023年3月23日

港科大浙大最新《深度生成模型三维表示》综述，20页pdf全面阐述3D生成进展

专知会员服务

47+阅读 · 2022年10月31日