Coping with distributional shifts is an important part of transfer learning methods in order to perform well in real-life tasks. However, most of the existing approaches in this area either focus on an ideal scenario in which the data does not contain noises or employ a complicated training paradigm or model design to deal with distributional shifts. In this paper, we revisit the robustness of the minimum error entropy (MEE) criterion, a widely used objective in statistical signal processing to deal with non-Gaussian noises, and investigate its feasibility and usefulness in real-life transfer learning regression tasks, where distributional shifts are common. Specifically, we put forward a new theoretical result showing the robustness of MEE against covariate shift. We also show that by simply replacing the mean squared error (MSE) loss with the MEE on basic transfer learning algorithms such as fine-tuning and linear probing, we can achieve competitive performance with respect to state-of-the-art transfer learning algorithms. We justify our arguments on both synthetic data and 5 real-world time-series data.
翻译:应对分布偏移是迁移学习方法在实际任务中取得良好表现的重要环节。然而,该领域现有方法大多聚焦于数据不包含噪声的理想场景,或采用复杂的训练范式与模型设计来处理分布偏移。本文重新审视了最小误差熵(MEE)准则的鲁棒性——该准则是统计信号处理中处理非高斯噪声的常用目标函数,并探究其在普遍存在分布偏移的实际迁移学习回归任务中的可行性与实用性。具体而言,我们提出了一项新的理论结果,证明MEE对协变量偏移具有鲁棒性。同时表明,只需将微调和线性探测等基础迁移学习算法中的均方误差(MSE)损失替换为MEE,即可达到与当前最先进迁移学习算法相媲美的性能。我们在合成数据及5个真实时间序列数据上验证了上述论点。