You Only Look Once (YOLO)-based object detectors have shown remarkable accuracy for automated brain tumor detection. In this paper, we develop a novel BGF-YOLO architecture by incorporating Bi-level routing attention, Generalized feature pyramid networks, and Fourth detecting head into YOLOv8. BGF-YOLO contains an attention mechanism to focus more on important features, and feature pyramid networks to enrich feature representation by merging high-level semantic features with spatial details. Furthermore, we investigate the effect of different attention mechanisms and feature fusions, detection head architectures on brain tumor detection accuracy. Experimental results show that BGF-YOLO gives a 4.7% absolute increase of mAP$_{50}$ compared to YOLOv8x, and achieves state-of-the-art on the brain tumor detection dataset Br35H. The code is available at https://github.com/mkang315/BGF-YOLO.
翻译:基于YOLO的目标检测器在自动化脑肿瘤检测中展现出卓越的准确性。本文通过将双层路由注意力机制、广义特征金字塔网络和第四检测头融入YOLOv8,提出了一种新颖的BGF-YOLO架构。BGF-YOLO包含注意力机制以更聚焦于重要特征,并通过融合高层语义特征与空间细节的特征金字塔网络来丰富特征表示。此外,我们研究了不同注意力机制、特征融合方式及检测头架构对脑肿瘤检测精度的影响。实验结果表明,与YOLOv8x相比,BGF-YOLO在mAP$_{50}$指标上实现了4.7%的绝对提升,并在脑肿瘤检测数据集Br35H上达到了最先进的性能。代码公开于https://github.com/mkang315/BGF-YOLO。