Transfer learning with affine model transformation

Supervised transfer learning has received considerable attention due to its potential to boost the predictive power of machine learning in scenarios where data are scarce. Generally, a given set of source models and a dataset from a target domain are used to adapt the pre-trained models to a target domain by statistically learning domain shift and domain-specific factors. While such procedurally and intuitively plausible methods have achieved great success in a wide range of real-world applications, the lack of a theoretical basis hinders further methodological development. This paper presents a general class of transfer learning regression called affine model transfer, following the principle of expected-square loss minimization. It is shown that the affine model transfer broadly encompasses various existing methods, including the most common procedure based on neural feature extractors. Furthermore, the current paper clarifies theoretical properties of the affine model transfer such as generalization error and excess risk. Through several case studies, we demonstrate the practical benefits of modeling and estimating inter-domain commonality and domain-specific factors separately with the affine-type transfer models.

翻译：监督式迁移学习因其在数据稀缺场景下提升机器学习预测能力的潜力而受到广泛关注。通常，通过统计学习领域偏移和领域特定因素，利用一组给定的源模型和目标域数据集，将预训练模型适配到目标域。虽然这类在程序和直觉上合理的方法已在众多实际应用中取得了巨大成功，但理论基础的缺乏阻碍了方法的进一步发展。本文提出一类通用的迁移学习回归方法，称为仿射模型迁移，遵循期望平方损失最小化原则。研究表明，仿射模型迁移广泛涵盖了包括基于神经特征提取器的通用方法在内的多种现有技术。此外，本文阐明了仿射模型迁移的理论性质，如泛化误差和过量风险。通过多个案例研究，我们展示了使用仿射型迁移模型分别建模和估计跨领域共性与领域特定因素的实际优势。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【ACL2021】认知启发的时序知识图谱两阶段推理模型

专知会员服务

46+阅读 · 2021年8月6日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

【AI应用】Facebook-利用神经网络求解高等数学方程, Using neural networks to solve advanced mathematics equations

专知会员服务

34+阅读 · 2020年1月15日