Towards Robust Continual Learning with Bayesian Adaptive Moment Regularization

The pursuit of long-term autonomy mandates that machine learning models must continuously adapt to their changing environments and learn to solve new tasks. Continual learning seeks to overcome the challenge of catastrophic forgetting, where learning to solve new tasks causes a model to forget previously learnt information. Prior-based continual learning methods are appealing as they are computationally efficient and do not require auxiliary models or data storage. However, prior-based approaches typically fail on important benchmarks and are thus limited in their potential applications compared to their memory-based counterparts. We introduce Bayesian adaptive moment regularization (BAdam), a novel prior-based method that better constrains parameter growth, reducing catastrophic forgetting. Our method boasts a range of desirable properties such as being lightweight and task label-free, converging quickly, and offering calibrated uncertainty that is important for safe real-world deployment. Results show that BAdam achieves state-of-the-art performance for prior-based methods on challenging single-headed class-incremental experiments such as Split MNIST and Split FashionMNIST, and does so without relying on task labels or discrete task boundaries.

翻译：实现长期自主性要求机器学习模型必须持续适应不断变化的环境并学习解决新任务。持续学习旨在克服灾难性遗忘的挑战——即学习解决新任务会导致模型遗忘先前习得的信息。基于先验的持续学习方法因其计算高效且无需辅助模型或数据存储而备受关注。然而，与基于记忆的方法相比，基于先验的方法通常在重要基准测试中表现不佳，从而限制了其潜在应用范围。本文提出贝叶斯自适应矩正则化（BAdam），这是一种新型的基于先验的方法，能更好地约束参数增长，从而减少灾难性遗忘。该方法具有一系列优良特性：轻量级、无需任务标签、快速收敛，并提供对实际安全部署至关重要的校准不确定性。实验结果表明，在Split MNIST和Split FashionMNIST等具有挑战性的单头类增量实验中，BAdam在基于先验的方法中实现了最先进的性能，且不依赖于任务标签或离散的任务边界。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日