This paper presents a new dataset and general tracker enhancement method for Underwater Visual Object Tracking (UVOT). Despite its significance, underwater tracking has remained unexplored due to data inaccessibility. It poses distinct challenges; the underwater environment exhibits non-uniform lighting conditions, low visibility, lack of sharpness, low contrast, camouflage, and reflections from suspended particles. Performance of traditional tracking methods designed primarily for terrestrial or open-air scenarios drops in such conditions. We address the problem by proposing a novel underwater image enhancement algorithm designed specifically to boost tracking quality. The method has resulted in a significant performance improvement, of up to 5.0% AUC, of state-of-the-art (SOTA) visual trackers. To develop robust and accurate UVOT methods, large-scale datasets are required. To this end, we introduce a large-scale UVOT benchmark dataset consisting of 400 video segments and 275,000 manually annotated frames enabling underwater training and evaluation of deep trackers. The videos are labelled with several underwater-specific tracking attributes including watercolor variation, target distractors, camouflage, target relative size, and low visibility conditions. The UVOT400 dataset, tracking results, and the code are publicly available on: https://github.com/BasitAlawode/UWVOT400.
翻译:本文提出了一种新的数据集和通用追踪器增强方法,用于水下视觉目标追踪(Underwater Visual Object Tracking, UVOT)。尽管水下追踪具有重要意义,但由于数据难以获取,该领域一直未得到充分探索。水下环境存在诸多独特挑战,包括不均匀光照条件、低可见度、缺乏清晰度、低对比度、伪装效应以及悬浮颗粒的反射。主要针对陆地或露天场景设计的传统追踪方法在此类条件下性能显著下降。我们通过提出一种专门用于提升追踪质量的新型水下图像增强算法来解决该问题。该方法使最先进(State-of-the-Art, SOTA)视觉追踪器的性能显著提升,AUC指标最高提高5.0%。为开发稳健且准确的UVOT方法,需要大规模数据集。为此,我们引入了一个大规模UVOT基准数据集,包含400个视频片段和27.5万张人工标注帧,可支持深度追踪器的水下训练与评估。视频按多种水下特定追踪属性进行标注,包括水色变化、目标干扰物、伪装、目标相对尺寸及低可见度条件。UVOT400数据集、追踪结果及代码已公开于:https://github.com/BasitAlawode/UWVOT400。