In the field of medical sciences, reliable detection and classification of brain tumors from images remains a formidable challenge due to the rarity of tumors within the population of patients. Therefore, the ability to detect tumors in anomaly scenarios is paramount for ensuring timely interventions and improved patient outcomes. This study addresses the issue by leveraging deep learning (DL) techniques to detect and classify brain tumors in challenging situations. The curated data set from the National Brain Mapping Lab (NBML) comprises 81 patients, including 30 Tumor cases and 51 Normal cases. The detection and classification pipelines are separated into two consecutive tasks. The detection phase involved comprehensive data analysis and pre-processing to modify the number of image samples and the number of patients of each class to anomaly distribution (9 Normal per 1 Tumor) to comply with real world scenarios. Next, in addition to common evaluation metrics for the testing, we employed a novel performance evaluation method called Patient to Patient (PTP), focusing on the realistic evaluation of the model. In the detection phase, we fine-tuned a YOLOv8n detection model to detect the tumor region. Subsequent testing and evaluation yielded competitive performance both in Common Evaluation Metrics and PTP metrics. Furthermore, using the Data Efficient Image Transformer (DeiT) module, we distilled a Vision Transformer (ViT) model from a fine-tuned ResNet152 as a teacher in the classification phase. This approach demonstrates promising strides in reliable tumor detection and classification, offering potential advancements in tumor diagnosis for real-world medical imaging scenarios.
翻译:在医学科学领域,由于肿瘤在患者群体中的罕见性,从图像中可靠地检测和分类脑肿瘤仍然是一项艰巨挑战。因此,在异常场景下检测肿瘤的能力对于确保及时干预和改善患者预后至关重要。本研究通过利用深度学习技术来解决这一问题,旨在在具有挑战性的情况下检测和分类脑肿瘤。来自国家脑图谱实验室的精选数据集包含81名患者,其中30例为肿瘤病例,51例为正常病例。检测与分类流程被分为两个连续任务。在检测阶段,我们进行了全面的数据分析和预处理,将图像样本数量及各类别患者数量调整为异常分布(每1例肿瘤对应9例正常样本),以符合真实世界场景。此外,除了常规测试评估指标,我们采用了一种名为"患者对患者"的新型性能评估方法,专注于模型的实际应用评估。在检测阶段,我们微调了YOLOv8n检测模型以定位肿瘤区域。后续测试与评估在常规指标和PTP指标上均展现出优异性能。进一步地,在分类阶段,我们利用数据高效图像Transformer模块,以微调后的ResNet152作为教师模型,蒸馏训练出视觉Transformer分类模型。该方法在可靠肿瘤检测与分类方面展现出显著进展,为真实世界医学影像场景中的肿瘤诊断提供了潜在的技术突破。