Active learning is considered a viable solution to alleviate the contradiction between the high dependency of deep learning-based segmentation methods on annotated data and the expensive pixel-level annotation cost of medical images. However, most existing methods suffer from unreliable uncertainty assessment and the struggle to balance diversity and informativeness, leading to poor performance in segmentation tasks. In response, we propose an efficient Predictive Accuracy-based Active Learning (PAAL) method for medical image segmentation, first introducing predictive accuracy to define uncertainty. Specifically, PAAL mainly consists of an Accuracy Predictor (AP) and a Weighted Polling Strategy (WPS). The former is an attached learnable module that can accurately predict the segmentation accuracy of unlabeled samples relative to the target model with the predicted posterior probability. The latter provides an efficient hybrid querying scheme by combining predicted accuracy and feature representation, aiming to ensure the uncertainty and diversity of the acquired samples. Extensive experiment results on multiple datasets demonstrate the superiority of PAAL. PAAL achieves comparable accuracy to fully annotated data while reducing annotation costs by approximately 50% to 80%, showcasing significant potential in clinical applications. The code is available at https://github.com/shijun18/PAAL-MedSeg.
翻译:主动学习被认为是缓解基于深度学习的图像分割方法对标注数据的高度依赖与医学图像像素级标注成本高昂之间矛盾的一种可行解决方案。然而,现有大多数方法存在不确定性评估不可靠、难以在多样性与信息量之间取得平衡的问题,导致分割任务性能不佳。为此,我们提出一种高效的基于预测准确率的主动学习方法(PAAL)用于医学图像分割,首次引入预测准确率来定义不确定性。具体而言,PAAL主要由准确率预测器(AP)和加权投票策略(WPS)构成。前者是一个附加的可学习模块,能够利用预测的后验概率准确估计未标注样本相对于目标模型的分割准确率;后者通过结合预测准确率与特征表示,提供高效的混合查询方案,旨在确保获取样本的不确定性与多样性。在多个数据集上的大量实验结果表明,PAAL具有优越性能。PAAL在将标注成本降低约50%至80%的同时,实现了与全标注数据相当的准确率,展现了显著的临床应用潜力。代码已开源至 https://github.com/shijun18/PAAL-MedSeg。