Few-shot class-incremental learning (FSCIL) confronts the challenge of integrating new classes into a model with minimal training samples while preserving the knowledge of previously learned classes. Traditional methods widely adopt static adaptation relying on a fixed parameter space to learn from data that arrive sequentially, prone to overfitting to the current session. Existing dynamic strategies require the expansion of the parameter space continually, leading to increased complexity. To address these challenges, we integrate the recently proposed selective state space model (SSM) into FSCIL. Concretely, we propose a dual selective SSM projector that dynamically adjusts the projection parameters based on the intermediate features for dynamic adaptation. The dual design enables the model to maintain the robust features of base classes, while adaptively learning distinctive feature shifts for novel classes. Additionally, we develop a class-sensitive selective scan mechanism to guide dynamic adaptation. It minimizes the disruption to base-class representations caused by training on novel data, and meanwhile, forces the selective scan to perform in distinct patterns between base and novel classes. Experiments on miniImageNet, CUB-200, and CIFAR-100 demonstrate that our framework outperforms the existing state-of-the-art methods. The code is available at https://github.com/xiaojieli0903/Mamba-FSCIL.
翻译:少样本类增量学习(FSCIL)面临在仅使用极少训练样本的情况下将新类整合到模型中,同时保持对已学类别知识的挑战。传统方法广泛采用静态适应策略,依赖固定的参数空间从顺序到达的数据中学习,容易对当前会话产生过拟合。现有的动态策略需要持续扩展参数空间,导致复杂度增加。为应对这些挑战,我们将近期提出的选择性状态空间模型(SSM)整合到FSCIL中。具体而言,我们提出了一种双重选择性SSM投影器,能够根据中间特征动态调整投影参数以实现动态适应。双重设计使模型能够保持基础类的鲁棒特征,同时自适应地学习新类别的差异性特征偏移。此外,我们开发了一种类敏感选择性扫描机制来引导动态适应过程。该机制最小化新数据训练对基础类表征的干扰,同时迫使选择性扫描在基础类与新类之间执行差异化的扫描模式。在miniImageNet、CUB-200和CIFAR-100数据集上的实验表明,我们的框架优于现有的最先进方法。代码发布于https://github.com/xiaojieli0903/Mamba-FSCIL。