Task-free online continual learning (TF-CL) is a challenging problem where the model incrementally learns tasks without explicit task information. Although training with entire data from the past, present as well as future is considered as the gold standard, naive approaches in TF-CL with the current samples may be conflicted with learning with samples in the future, leading to catastrophic forgetting and poor plasticity. Thus, a proactive consideration of an unseen future sample in TF-CL becomes imperative. Motivated by this intuition, we propose a novel TF-CL framework considering future samples and show that injecting adversarial perturbations on both input data and decision-making is effective. Then, we propose a novel method named Doubly Perturbed Continual Learning (DPCL) to efficiently implement these input and decision-making perturbations. Specifically, for input perturbation, we propose an approximate perturbation method that injects noise into the input data as well as the feature vector and then interpolates the two perturbed samples. For decision-making process perturbation, we devise multiple stochastic classifiers. We also investigate a memory management scheme and learning rate scheduling reflecting our proposed double perturbations. We demonstrate that our proposed method outperforms the state-of-the-art baseline methods by large margins on various TF-CL benchmarks.
翻译:无任务在线持续学习(TF-CL)是一个具有挑战性的问题,要求模型在没有明确任务信息的情况下逐步学习任务。尽管使用过去、现在乃至未来的全部数据进行训练被视为黄金标准,但TF-CL中基于当前样本的朴素方法可能与未来样本的学习产生冲突,导致灾难性遗忘和可塑性不足。因此,在TF-CL中主动考虑未见过的未来样本变得至关重要。受此直觉启发,我们提出了一种考虑未来样本的新型TF-CL框架,并证明在输入数据和决策过程中注入对抗性扰动是有效的。进而,我们提出了一种名为双重扰动持续学习(DPCL)的新方法,以高效实现这些输入和决策扰动。具体而言,针对输入扰动,我们提出一种近似扰动方法,向输入数据及特征向量注入噪声,并对两个扰动样本进行插值。针对决策过程扰动,我们设计了多个随机分类器。我们还研究了反映所提双重扰动的内存管理方案和学习率调度策略。实验表明,我们的方法在多个TF-CL基准测试中大幅超越了当前最先进的基线方法。