LRR: Language-Driven Resamplable Continuous Representation against Adversarial Tracking Attacks

Visual object tracking plays a critical role in visual-based autonomous systems, as it aims to estimate the position and size of the object of interest within a live video. Despite significant progress made in this field, state-of-the-art (SOTA) trackers often fail when faced with adversarial perturbations in the incoming frames. This can lead to significant robustness and security issues when these trackers are deployed in the real world. To achieve high accuracy on both clean and adversarial data, we propose building a spatial-temporal continuous representation using the semantic text guidance of the object of interest. This novel continuous representation enables us to reconstruct incoming frames to maintain semantic and appearance consistency with the object of interest and its clean counterparts. As a result, our proposed method successfully defends against different SOTA adversarial tracking attacks while maintaining high accuracy on clean data. In particular, our method significantly increases tracking accuracy under adversarial attacks with around 90% relative improvement on UAV123, which is even higher than the accuracy on clean data.

翻译：视觉目标跟踪在基于视觉的自主系统中扮演关键角色，其目标是在实时视频中估计感兴趣目标的位置和尺寸。尽管该领域已取得显著进展，但当前最先进的跟踪器在面临输入帧中的对抗性扰动时常常失效，这可能导致这些跟踪器在现实世界部署时出现严重的鲁棒性与安全性问题。为了在干净数据和对抗数据上均实现高精度，我们提出利用感兴趣目标的语义文本引导构建时空连续表示。这种新颖的连续表示使我们能够重构输入帧，以保持与感兴趣目标及其干净对应物在语义和外观上的一致性。因此，本文方法成功防御了多种最先进的对抗追踪攻击，同时在干净数据上保持高精度。特别地，在UAV123数据集上，本文方法在对抗攻击下将追踪精度提升约90%（相对提升），甚至高于干净数据上的精度。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日