Divide and not forget: Ensemble of selectively trained experts in Continual Learning

Class-incremental learning is becoming more popular as it helps models widen their applicability while not forgetting what they already know. A trend in this area is to use a mixture-of-expert technique, where different models work together to solve the task. However, the experts are usually trained all at once using whole task data, which makes them all prone to forgetting and increasing computational burden. To address this limitation, we introduce a novel approach named SEED. SEED selects only one, the most optimal expert for a considered task, and uses data from this task to fine-tune only this expert. For this purpose, each expert represents each class with a Gaussian distribution, and the optimal expert is selected based on the similarity of those distributions. Consequently, SEED increases diversity and heterogeneity within the experts while maintaining the high stability of this ensemble method. The extensive experiments demonstrate that SEED achieves state-of-the-art performance in exemplar-free settings across various scenarios, showing the potential of expert diversification through data in continual learning.

翻译：类增量学习日益受到关注，因为它帮助模型在拓展应用范围的同时不遗忘已有知识。该领域的一个趋势是采用混合专家技术，即不同模型协同解决问题。然而，现有专家通常使用完整任务数据同时训练，导致它们都容易发生遗忘并增加计算负担。为解决这一局限，我们提出名为SEED的新方法。SEED仅为当前任务选择一个最优专家，并使用该任务数据仅微调该专家。为此，每个专家用高斯分布表示每个类别，通过分布相似性选择最优专家。SEED在保持集成方法高稳定性的同时，增强了专家间的多样性与异质性。大量实验表明，在多种无样本场景下，SEED均达到最佳性能，展现了通过数据实现专家多样化在持续学习中的潜力。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

语言视觉预训练语言模型揭密，Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

专知会员服务

36+阅读 · 2020年5月20日

Query2box: 使用盒嵌入对向量空间中的知识图谱进行推理，Query2box: Reasoning over Knowledge Graphs in Vector Space Using Box Embeddings

专知会员服务

46+阅读 · 2020年5月11日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日