We develop here a novel transfer learning methodology called Profiled Transfer Learning (PTL). The method is based on the \textit{approximate-linear} assumption between the source and target parameters. Compared with the commonly assumed \textit{vanishing-difference} assumption and \textit{low-rank} assumption in the literature, the \textit{approximate-linear} assumption is more flexible and less stringent. Specifically, the PTL estimator is constructed by two major steps. Firstly, we regress the response on the transferred feature, leading to the profiled responses. Subsequently, we learn the regression relationship between profiled responses and the covariates on the target data. The final estimator is then assembled based on the \textit{approximate-linear} relationship. To theoretically support the PTL estimator, we derive the non-asymptotic upper bound and minimax lower bound. We find that the PTL estimator is minimax optimal under appropriate regularity conditions. Extensive simulation studies are presented to demonstrate the finite sample performance of the new method. A real data example about sentence prediction is also presented with very encouraging results.
翻译:本文提出了一种新颖的迁移学习方法,称为特征轮廓迁移学习。该方法基于源域与目标域参数间的近似线性假设。与文献中常见的差异消失假设和低秩假设相比,近似线性假设更具灵活性且约束条件更宽松。具体而言,PTL估计量的构建包含两个主要步骤:首先,将响应变量对迁移特征进行回归,得到轮廓化响应变量;随后,在目标数据上学习轮廓化响应变量与协变量之间的回归关系。最终估计量基于近似线性关系进行整合。为从理论上支持PTL估计量,我们推导了非渐近上界和极小极大下界,发现在适当的正则条件下该估计量具有极小极大最优性。通过大量模拟研究验证了新方法在有限样本下的性能表现,并以句子预测的实际数据案例展示了该方法令人鼓舞的应用效果。