A soft nearest-neighbor framework for continual semi-supervised learning

Despite significant advances, the performance of state-of-the-art continual learning approaches hinges on the unrealistic scenario of fully labeled data. In this paper, we tackle this challenge and propose an approach for continual semi-supervised learning--a setting where not all the data samples are labeled. A primary issue in this scenario is the model forgetting representations of unlabeled data and overfitting the labeled samples. We leverage the power of nearest-neighbor classifiers to nonlinearly partition the feature space and flexibly model the underlying data distribution thanks to its non-parametric nature. This enables the model to learn a strong representation for the current task, and distill relevant information from previous tasks. We perform a thorough experimental evaluation and show that our method outperforms all the existing approaches by large margins, setting a solid state of the art on the continual semi-supervised learning paradigm. For example, on CIFAR-100 we surpass several others even when using at least 30 times less supervision (0.8% vs. 25% of annotations). Finally, our method works well on both low and high resolution images and scales seamlessly to more complex datasets such as ImageNet-100. The code is publicly available on https://github.com/kangzhiq/NNCSL

翻译：尽管取得了显著进展，现有持续学习方法的表现仍依赖于全标注数据这一不切实际的场景。本文针对这一挑战，提出了一种面向持续半监督学习（即并非所有数据样本均有标注）的方法。该场景的核心问题在于模型会遗忘未标注数据的表征，并对已标注样本产生过拟合。我们利用最近邻分类器的能力对特征空间进行非线性划分，并凭借其非参数特性灵活建模底层数据分布。这使得模型能够为当前任务学习强大的表征，并从历史任务中提取相关信息。通过全面的实验评估，我们证明该方法以较大优势超越了所有现有方法，为持续半监督学习范式树立了坚实的新标杆。例如，在CIFAR-100数据集上，即使所用监督信号减少至少30倍（标注量仅为0.8% vs. 25%），我们的方法仍优于其他多种方案。此外，该方法在低分辨率与高分辨率图像上均表现优异，并能无缝扩展至ImageNet-100等更复杂的数据集。代码已开源发布于 https://github.com/kangzhiq/NNCSL

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日

斯坦福李飞飞高徒Johnson博士论文: 组成式计算机视觉智能,195页PDF

专知会员服务

71+阅读 · 2019年10月27日