Artificial intelligence (AI) is increasingly being used for medical imaging tasks. However, there can be biases in AI models, particularly when they are trained using imbalanced training datasets. One such example has been the strong ethnicity bias effect in cardiac magnetic resonance (CMR) image segmentation models. Although this phenomenon has been reported in a number of publications, little is known about the effectiveness of bias mitigation algorithms in this domain. We aim to investigate the impact of common bias mitigation methods to address bias between Black and White subjects in AI-based CMR segmentation models. Specifically, we use oversampling, importance reweighing and Group DRO as well as combinations of these techniques to mitigate the ethnicity bias. Second, motivated by recent findings on the root causes of AI-based CMR segmentation bias, we evaluate the same methods using models trained and evaluated on cropped CMR images. We find that bias can be mitigated using oversampling, significantly improving performance for the underrepresented Black subjects whilst not significantly reducing the majority White subjects' performance. Using cropped images increases performance for both ethnicities and reduces the bias, whilst adding oversampling as a bias mitigation technique with cropped images reduces the bias further. When testing the models on an external clinical validation set, we find high segmentation performance and no statistically significant bias.
翻译:人工智能(AI)在医学影像任务中的应用日益广泛。然而,AI模型可能存在偏置,尤其是在使用不平衡训练数据集进行训练时。心脏磁共振(CMR)图像分割模型中存在的显著种族偏置效应即为典型案例。尽管多篇文献已报道该现象,但该领域偏置缓解算法的有效性仍缺乏深入研究。本研究旨在探究常见偏置缓解方法对解决基于AI的CMR分割模型中黑人与白人受试者间偏置的影响。具体而言,我们采用过采样、重要性重加权和群体分布鲁棒优化(Group DRO)及其组合技术来缓解种族偏置。其次,基于近期关于AI-CMR分割偏置根源的研究发现,我们使用在裁剪后CMR图像上训练和评估的模型对相同方法进行验证。研究发现:过采样可有效缓解偏置,显著提升代表性不足的黑人受试者性能,同时未显著降低占多数的白人受试者性能;使用裁剪图像可提升两种族群的性能并降低偏置,而结合裁剪图像与过采样技术能进一步减少偏置。在外部临床验证集测试中,模型展现出高分割性能且未出现统计学显著偏置。