Realism in Action: Anomaly-Aware Diagnosis of Brain Tumors from Medical Images Using YOLOv8 and DeiT

from arxiv, This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

In the field of medical sciences, reliable detection and classification of brain tumors from images remains a formidable challenge due to the rarity of tumors within the population of patients. Therefore, the ability to detect tumors in anomaly scenarios is paramount for ensuring timely interventions and improved patient outcomes. This study addresses the issue by leveraging deep learning (DL) techniques to detect and classify brain tumors in challenging situations. The curated data set from the National Brain Mapping Lab (NBML) comprises 81 patients, including 30 Tumor cases and 51 Normal cases. The detection and classification pipelines are separated into two consecutive tasks. The detection phase involved comprehensive data analysis and pre-processing to modify the number of image samples and the number of patients of each class to anomaly distribution (9 Normal per 1 Tumor) to comply with real world scenarios. Next, in addition to common evaluation metrics for the testing, we employed a novel performance evaluation method called Patient to Patient (PTP), focusing on the realistic evaluation of the model. In the detection phase, we fine-tuned a YOLOv8n detection model to detect the tumor region. Subsequent testing and evaluation yielded competitive performance both in Common Evaluation Metrics and PTP metrics. Furthermore, using the Data Efficient Image Transformer (DeiT) module, we distilled a Vision Transformer (ViT) model from a fine-tuned ResNet152 as a teacher in the classification phase. This approach demonstrates promising strides in reliable tumor detection and classification, offering potential advancements in tumor diagnosis for real-world medical imaging scenarios.

翻译：在医学科学领域，由于肿瘤在患者群体中的罕见性，从医学图像中可靠地检测和分类脑肿瘤仍是一项严峻挑战。因此，在异常场景中检测肿瘤的能力对于确保及时干预和改善患者预后至关重要。本研究利用深度学习技术解决这一挑战，在困难情境下检测和分类脑肿瘤。从国家脑图谱实验室（NBML）整理的数据集包含81例患者，包括30例肿瘤病例和51例正常病例。检测与分类流程被分为两个连续任务。检测阶段涉及全面的数据分析和预处理，通过调整每类图像样本数和患者数，使其符合异常分布（每9例正常对应1例肿瘤），以模拟真实世界场景。此外，除常规测试评估指标外，我们采用了一种名为"患者对患者（Patient to Patient，PTP）"的新型性能评估方法，聚焦于模型的现实评估。在检测阶段，我们微调了YOLOv8n检测模型以定位肿瘤区域。后续测试与评估在常规评估指标和PTP指标上均取得具有竞争力的表现。进一步地，利用数据高效图像Transformer模块，我们在分类阶段以微调后的ResNet152作为教师模型，蒸馏出视觉Transformer（ViT）模型。该方法在可靠肿瘤检测与分类方面展现出可喜进展，为真实医学影像场景中的肿瘤诊断提供了潜在的进步方向。