Digital pathology has recently been revolutionized by advancements in artificial intelligence, deep learning, and high-performance computing. With its advanced tools, digital pathology can help improve and speed up the diagnostic process, reduce human errors, and streamline the reporting step. In this paper, we report a new large red blood cell (RBC) image dataset and propose a two-stage deep learning framework for RBC image segmentation and classification. The dataset is a highly diverse dataset of more than 100K RBCs containing eight different classes. The dataset, which is considerably larger than any publicly available hematopathology dataset, was labeled independently by two hematopathologists who also manually created masks for RBC cell segmentation. Subsequently, in the proposed framework, first, a U-Net model was trained to achieve automatic RBC image segmentation. Second, an EfficientNetB0 model was trained to classify RBC images into one of the eight classes using a transfer learning approach with a 5X2 cross-validation scheme. An IoU of 98.03% and an average classification accuracy of 96.5% were attained on the test set. Moreover, we have performed experimental comparisons against several prominent CNN models. These comparisons show the superiority of the proposed model with a good balance between performance and computational cost.
翻译:数字病理学近年来因人工智能、深度学习及高性能计算的进步而实现革新。借助其先进工具,数字病理学能够改善并加速诊断流程、减少人为错误,并简化报告环节。本文报告了一个新的大型红细胞图像数据集,并提出一种用于红细胞图像分割与分类的两阶段深度学习框架。该数据集包含超过10万个红细胞,涵盖八种不同类别,规模远超任何公开的血液病理学数据集。数据由两位血液病理学家独立标注,并手工创建了红细胞分割掩膜。随后在提出的框架中,首先训练U-Net模型实现自动红细胞图像分割,其次采用迁移学习策略,通过5×2交叉验证方案训练EfficientNetB0模型,将红细胞图像分为八类。测试集上交并比达98.03%,平均分类准确率达96.5%。此外,我们与多个主流卷积神经网络模型进行了实验对比,结果表明所提模型在性能与计算成本之间达到了良好平衡,具有显著优越性。