ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation

Since real-world machine systems are running in non-stationary environments, Continual Test-Time Adaptation (CTTA) task is proposed to adapt the pre-trained model to continually changing target domains. Recently, existing methods mainly focus on model-based adaptation, which aims to leverage a self-training manner to extract the target domain knowledge. However, pseudo labels can be noisy and the updated model parameters are unreliable under dynamic data distributions, leading to error accumulation and catastrophic forgetting in the continual adaptation process. To tackle these challenges and maintain the model plasticity, we design a Visual Domain Adapter (ViDA) for CTTA, explicitly handling both domain-specific and domain-shared knowledge. Specifically, we first comprehensively explore the different domain representations of the adapters with trainable high-rank or low-rank embedding spaces. Then we inject ViDAs into the pre-trained model, which leverages high-rank and low-rank features to adapt the current domain distribution and maintain the continual domain-shared knowledge, respectively. To exploit the low-rank and high-rank ViDAs more effectively, we further propose a Homeostatic Knowledge Allotment (HKA) strategy, which adaptively combines different knowledge from each ViDA. Extensive experiments conducted on four widely used benchmarks demonstrate that our proposed method achieves state-of-the-art performance in both classification and segmentation CTTA tasks. Note that, our method can be regarded as a novel transfer paradigm for large-scale models, delivering promising results in adaptation to continually changing distributions. Project page: https://sites.google.com/view/iclr2024-vida/home.

翻译：现实世界中的机器系统运行在非平稳环境下，因此提出持续测试时适应任务，旨在使预训练模型适应持续变化的目标域。现有方法主要聚焦于基于模型的适应，通过自训练方式提取目标域知识。然而，在动态数据分布下，伪标签可能存在噪声，且更新的模型参数不可靠，导致持续适应过程中出现误差累积与灾难性遗忘。为解决上述挑战并保持模型可塑性，我们为CTTA设计了视觉域适配器，显式处理域特定知识与域共享知识。具体而言，我们首先全面探索了具有可训练高秩或低秩嵌入空间的适配器所呈现的不同域表示；随后将ViDA注入预训练模型，分别利用高秩与低秩特征适配当前域分布并维持持续域共享知识。为更有效利用低秩与高秩ViDA，我们进一步提出稳态知识分配策略，自适应整合各ViDA的不同知识。在四个广泛使用的基准上开展的广泛实验表明，本方法在分类与分割CTTA任务中均达到最佳性能。值得注意的是，本方法可作为大型模型的新型迁移范式，在适应持续变化分布方面展现出优异效果。项目页面：https://sites.google.com/view/iclr2024-vida/home。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

14+阅读 · 2022年3月12日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日