Regularization-Based Efficient Continual Learning in Deep State-Space Models

Deep state-space models (DSSMs) have gained popularity in recent years due to their potent modeling capacity for dynamic systems. However, existing DSSM works are limited to single-task modeling, which requires retraining with historical task data upon revisiting a forepassed task. To address this limitation, we propose continual learning DSSMs (CLDSSMs), which are capable of adapting to evolving tasks without catastrophic forgetting. Our proposed CLDSSMs integrate mainstream regularization-based continual learning (CL) methods, ensuring efficient updates with constant computational and memory costs for modeling multiple dynamic systems. We also conduct a comprehensive cost analysis of each CL method applied to the respective CLDSSMs, and demonstrate the efficacy of CLDSSMs through experiments on real-world datasets. The results corroborate that while various competing CL methods exhibit different merits, the proposed CLDSSMs consistently outperform traditional DSSMs in terms of effectively addressing catastrophic forgetting, enabling swift and accurate parameter transfer to new tasks.

翻译：深度状态空间模型（DSSMs）近年来因对动态系统强大的建模能力而受到广泛关注。然而，现有DSSM研究局限于单任务建模，当重新处理先前任务时需要利用历史任务数据重新训练模型。为解决这一局限，我们提出持续学习DSSMs（CLDSSMs），该模型无需灾难性遗忘即可适应不断演变的任务。所提出的CLDSSMs集成了主流基于正则化的持续学习（CL）方法，通过恒定的计算与存储成本确保对多个动态系统的高效更新。我们针对各CL方法在对应CLDSSMs中的应用进行了全面的成本分析，并通过真实世界数据集实验验证CLDSSMs的有效性。结果表明，尽管不同竞争性CL方法各具优势，但所提出的CLDSSMs在有效解决灾难性遗忘、实现快速准确的参数迁移至新任务方面，始终优于传统DSSMs。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日