BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation

Continual Test Time Adaptation (CTTA) is required to adapt efficiently to continuous unseen domains while retaining previously learned knowledge. However, despite the progress of CTTA, forgetting-adaptation trade-offs and efficiency are still unexplored. Moreover, current CTTA scenarios assume only the disjoint situation, even though real-world domains are seamlessly changed. To tackle these challenges, this paper proposes BECoTTA, an input-dependent yet efficient framework for CTTA. We propose Mixture-of-Domain Low-rank Experts (MoDE) that contains two core components: (i) Domain-Adaptive Routing, which aids in selectively capturing the domain-adaptive knowledge with multiple domain routers, and (ii) Domain-Expert Synergy Loss to maximize the dependency between each domain and expert. We validate our method outperforms multiple CTTA scenarios including disjoint and gradual domain shits, while only requiring ~98% fewer trainable parameters. We also provide analyses of our method, including the construction of experts, the effect of domain-adaptive experts, and visualizations.

翻译：持续测试时自适应（CTTA）要求模型在高效适应持续出现的未知域的同时，保留先前学习到的知识。然而，尽管CTTA已取得进展，但遗忘-适应权衡与效率问题仍未得到充分探索。此外，当前CTTA场景仅假设域不连续变化的情形，而现实世界的域往往是平滑渐变的。为解决这些挑战，本文提出BECoTTA——一种输入依赖且高效的CTTA框架。我们提出了混合域低秩专家（MoDE）模块，其包含两个核心组件：（i）域自适应路由，通过多个域路由器选择性捕获域自适应知识；（ii）域-专家协同损失，最大化每个域与对应专家之间的依赖性。实验表明，我们的方法在包括不连续与渐变域偏移在内的多种CTTA场景中均取得更优性能，且可训练参数减少约98%。我们还提供了方法分析，包括专家构建机制、域自适应专家效果及可视化结果。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日