Due to the limited availability of data, existing few-shot learning methods trained from scratch fail to achieve satisfactory performance. In contrast, large-scale pre-trained models such as CLIP demonstrate remarkable few-shot and zero-shot capabilities. To enhance the performance of pre-trained models for downstream tasks, fine-tuning the model on downstream data is frequently necessary. However, fine-tuning the pre-trained model leads to a decrease in its generalizability in the presence of distribution shift, while the limited number of samples in few-shot learning makes the model highly susceptible to overfitting. Consequently, existing methods for fine-tuning few-shot learning primarily focus on fine-tuning the model's classification head or introducing additional structure. In this paper, we introduce a fine-tuning approach termed Feature Discrimination Alignment (FD-Align). Our method aims to bolster the model's generalizability by preserving the consistency of spurious features across the fine-tuning process. Extensive experimental results validate the efficacy of our approach for both ID and OOD tasks. Once fine-tuned, the model can seamlessly integrate with existing methods, leading to performance improvements. Our code can be found in https://github.com/skingorz/FD-Align.
翻译:由于数据可用性有限,现有从头训练的小样本学习方法难以取得令人满意的性能。相比之下,CLIP等大规模预训练模型展现出卓越的小样本和零样本能力。为提升预训练模型在下游任务中的性能,通常需要在下游数据上对模型进行微调。然而,微调预训练模型会因分布偏移导致其泛化能力下降,而小样本学习中样本数量有限又使模型极易过拟合。因此,现有小样本学习的微调方法主要聚焦于微调模型的分类头或引入额外结构。本文提出一种名为特征判别对齐(FD-Align)的微调方法。该方法旨在通过保持微调过程中虚假特征的一致性来增强模型的泛化能力。大量实验结果验证了该方法在ID和OOD任务上的有效性。经微调后,该模型可无缝集成现有方法并提升性能。我们的代码可在https://github.com/skingorz/FD-Align获取。