As an important component of the detector localization branch, bounding box regression loss plays a significant role in object detection tasks. The existing bounding box regression methods usually consider the geometric relationship between the GT box and the predicted box, and calculate the loss by using the relative position and shape of the bounding boxes, while ignoring the influence of inherent properties such as the shape and scale of the bounding boxes on bounding box regression. In order to make up for the shortcomings of existing research, this article proposes a bounding box regression method that focuses on the shape and scale of the bounding box itself. Firstly, we analyzed the regression characteristics of the bounding boxes and found that the shape and scale factors of the bounding boxes themselves will have an impact on the regression results. Based on the above conclusions, we propose the Shape IoU method, which can calculate the loss by focusing on the shape and scale of the bounding box itself, thereby making the bounding box regression more accurate. Finally, we validated our method through a large number of comparative experiments, which showed that our method can effectively improve detection performance and outperform existing methods, achieving state-of-the-art performance in different detection tasks.Code is available at https://github.com/malagoutou/Shape-IoU
翻译:作为检测器定位分支的重要组成部分,边界框回归损失在目标检测任务中扮演着关键角色。现有边界框回归方法通常考虑真实框与预测框之间的几何关系,利用边界框的相对位置和形状计算损失,却忽略了边界框自身形状与尺度等固有属性对回归过程的影响。为弥补现有研究的不足,本文提出一种聚焦于边界框自身形状与尺度的回归方法。首先,我们分析了边界框的回归特性,发现边界框自身的形状因子和尺度因子会对回归结果产生影响。基于上述结论,我们提出Shape IoU方法,该方法通过关注边界框自身的形状与尺度计算损失,从而使边界框回归更加精确。最后,通过大量对比实验验证了该方法能够有效提升检测性能,在多种检测任务中均优于现有方法,达到了最先进的性能水平。代码开源地址:https://github.com/malagoutou/Shape-IoU