Deep neural network can easily overfit to even noisy labels due to its high capacity, which degrades the generalization performance of a model. To overcome this issue, we propose a new approach for learning from noisy labels (LNL) via post-training, which can significantly improve the generalization performance of any pre-trained model on noisy label data. To this end, we rather exploit the overfitting property of a trained model to identify mislabeled samples. Specifically, our post-training approach gradually removes samples with high influence on the decision boundary and refines the decision boundary to improve generalization performance. Our post-training approach creates great synergies when combined with the existing LNL methods. Experimental results on various real-world and synthetic benchmark datasets demonstrate the validity of our approach in diverse realistic scenarios.
翻译:深度神经网络因其高容量特性容易过拟合甚至包含噪声的标签,从而降低模型的泛化性能。为解决这一问题,我们提出了一种基于后训练(post-training)的噪声标签学习新方法,该方法能够显著提升任意预训练模型在噪声标签数据上的泛化性能。为此,我们反而利用已训练模型的过拟合特性来识别错误标注样本。具体而言,我们的后训练方法逐步移除对决策边界影响较大的样本,并优化决策边界以提升泛化性能。该方法与现有噪声标签学习方法结合时能产生显著的协同效应。在各类真实与合成基准数据集上的实验结果表明,我们的方法在多种实际场景中均具有有效性。