TriRE: A Multi-Mechanism Learning Paradigm for Continual Knowledge Retention and Promotion

Continual learning (CL) has remained a persistent challenge for deep neural networks due to catastrophic forgetting (CF) of previously learned tasks. Several techniques such as weight regularization, experience rehearsal, and parameter isolation have been proposed to alleviate CF. Despite their relative success, these research directions have predominantly remained orthogonal and suffer from several shortcomings, while missing out on the advantages of competing strategies. On the contrary, the brain continually learns, accommodates, and transfers knowledge across tasks by simultaneously leveraging several neurophysiological processes, including neurogenesis, active forgetting, neuromodulation, metaplasticity, experience rehearsal, and context-dependent gating, rarely resulting in CF. Inspired by how the brain exploits multiple mechanisms concurrently, we propose TriRE, a novel CL paradigm that encompasses retaining the most prominent neurons for each task, revising and solidifying the extracted knowledge of current and past tasks, and actively promoting less active neurons for subsequent tasks through rewinding and relearning. Across CL settings, TriRE significantly reduces task interference and surpasses different CL approaches considered in isolation.

翻译：持续学习（CL）因灾难性遗忘（CF）先前学习任务而始终是深度神经网络的持久挑战。研究者已提出权重正则化、经验重演、参数隔离等多种技术以缓解CF。尽管这些方法取得相对成功，但它们本质上仍保持正交性，存在若干缺陷，并错失了竞争策略的优势。相反，大脑通过同步利用多种神经生理过程——包括神经发生、主动遗忘、神经调节、元可塑性、经验重演和情境依赖性门控，持续学习、适应和迁移跨任务知识，极少出现CF。受大脑并发运用多重机制的启发，我们提出TriRE这一新型CL范式，该范式包含：为每个任务保留最突出神经元、修正并巩固当前与过去任务提取的知识，以及通过回退与重学习主动促进非活跃神经元适应后续任务。在多种CL设置中，TriRE显著减少任务干扰，并超越各类独立考虑的CL方法。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日