面向理解参数迁移中的特征学习 (Towards Understanding Feature Learning in Parameter Transfer) - 专知论文

会员服务 ·

0

分析 · 知识 · 特征学习 · 有效性 · ReLU ·

Towards Understanding Feature Learning in Parameter Transfer

翻译：面向理解参数迁移中的特征学习

Hua Yuan,Xuran Meng,Qiufeng Wang,Shiyu Xia,Ning Xu,Xu Yang,Jing Wang,Xin Geng,Yong Rui

Parameter transfer is a central paradigm in transfer learning, enabling knowledge reuse across tasks and domains by sharing model parameters between upstream and downstream models. However, when only a subset of parameters from the upstream model is transferred to the downstream model, there remains a lack of theoretical understanding of the conditions under which such partial parameter reuse is beneficial and of the factors that govern its effectiveness. To address this gap, we analyze a setting in which both the upstream and downstream models are ReLU convolutional neural networks (CNNs). Within this theoretical framework, we characterize how the inherited parameters act as carriers of universal knowledge and identify key factors that amplify their beneficial impact on the target task. Furthermore, our analysis provides insight into why, in certain cases, transferring parameters can lead to lower test accuracy on the target task than training a new model from scratch. To our best knowledge, our theory is the first to provide a dynamic analysis for parameter transfer and also the first to prove the existence of negative transfer theoretically. Numerical experiments and real-world data experiments are conducted to empirically validate our theoretical findings.

翻译：参数迁移是迁移学习的核心范式，通过在上游模型与下游模型间共享模型参数，实现跨任务和跨领域的知识复用。然而，当仅将上游模型的部分参数迁移至下游模型时，对于此类部分参数复用何时有益以及影响其有效性的关键因素，目前仍缺乏理论上的理解。为填补这一空白，我们分析了一种场景：上游模型与下游模型均为ReLU卷积神经网络（CNN）。在此理论框架内，我们刻画了继承参数如何作为通用知识的载体，并识别了增强其对目标任务有益影响的关键因素。此外，我们的分析揭示了为何在某些情况下，迁移参数可能导致目标任务上的测试精度低于从头训练新模型。据我们所知，我们的理论首次为参数迁移提供了动态分析，同时也是首个在理论上证明负迁移存在的理论。我们通过数值实验与真实数据实验对所提理论发现进行了实证验证。

0

相关内容

《可信迁移学习：综述》

《可信迁移学习：综述》

专知会员服务

28+阅读 · 2024年12月20日

【牛津大学博士论文】序列决策中的迁移学习

【牛津大学博士论文】序列决策中的迁移学习

专知会员服务

24+阅读 · 2024年11月10日

面向工业监控典型监督任务的深度迁移学习方法：现状、挑战与展望

面向工业监控典型监督任务的深度迁移学习方法：现状、挑战与展望

专知会员服务

38+阅读 · 2023年1月8日

【AAAI2023】图上的非独立同分布迁移学习

【AAAI2023】图上的非独立同分布迁移学习

专知会员服务

24+阅读 · 2022年12月25日

【CMU博士论文】缓解负迁移提高迁移学习的泛化和效率，201页pdf

【CMU博士论文】缓解负迁移提高迁移学习的泛化和效率，201页pdf

专知会员服务

57+阅读 · 2022年4月19日

贝叶斯迁移学习: 迁移学习的概率图模型概述

贝叶斯迁移学习: 迁移学习的概率图模型概述

专知会员服务

70+阅读 · 2021年10月17日

最新《深度强化学习中的迁移学习》综述论文

最新《深度强化学习中的迁移学习》综述论文

专知会员服务

157+阅读 · 2020年9月20日

【IBM】在视觉和关系推理中迁移学习，Transfer Learning in Visual and Relational Reasoning

【IBM】在视觉和关系推理中迁移学习，Transfer Learning in Visual and Relational Reasoning

专知会员服务

45+阅读 · 2020年1月15日

【论文|迁移自适应学习综述】Transfer Adaptation Learning: A Decade Survey

【论文|迁移自适应学习综述】Transfer Adaptation Learning: A Decade Survey

专知会员服务

45+阅读 · 2019年11月26日

【中科院计算所】迁移学习全面综述论文，A Comprehensive Survey on Transfer Learning，27页pdf，171篇参考文献

【中科院计算所】迁移学习全面综述论文，A Comprehensive Survey on Transfer Learning，27页pdf，171篇参考文献

专知会员服务

99+阅读 · 2019年11月11日

中科院发布最新迁移学习综述论文，带你全面了解40种迁移学习方法

中科院发布最新迁移学习综述论文，带你全面了解40种迁移学习方法

专知

48+阅读 · 2019年11月12日

八千字长文深度解读，迁移学习在强化学习中的应用及最新进展

八千字长文深度解读，迁移学习在强化学习中的应用及最新进展

机器之心

13+阅读 · 2019年10月17日

迁移自适应学习最新综述，附21页论文下载

迁移自适应学习最新综述，附21页论文下载

专知

34+阅读 · 2019年3月13日

一文了解迁移学习经典算法

一文了解迁移学习经典算法

AI100

11+阅读 · 2018年8月4日

【迁移学习】简述迁移学习在深度学习中的应用

【迁移学习】简述迁移学习在深度学习中的应用

产业智能官

15+阅读 · 2018年1月9日

【迁移学习】迁移学习的干货学习资料 | 干货分享 | 技术解读

【迁移学习】迁移学习的干货学习资料 | 干货分享 | 技术解读

产业智能官

15+阅读 · 2018年1月2日

迁移学习在深度学习中的应用

迁移学习在深度学习中的应用

专知

24+阅读 · 2017年12月24日

什么是迁移学习？它都用在深度学习的哪些场景上？这篇文章替你讲清楚了

什么是迁移学习？它都用在深度学习的哪些场景上？这篇文章替你讲清楚了

AI100

16+阅读 · 2017年12月23日

深度 | 迁移学习全面概述：从基本概念到相关研究

深度 | 迁移学习全面概述：从基本概念到相关研究

七月在线实验室

15+阅读 · 2017年8月15日

独家 | 一文读懂迁移学习（附学习工具包）

独家 | 一文读懂迁移学习（附学习工具包）

数据派THU

13+阅读 · 2017年7月13日

面向推荐系统中异构隐式反馈建模的迁移学习技术研究

国家自然科学基金

5+阅读 · 2015年12月31日

基于迁移学习的图像隐写分析新方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于生态演替的文本大数据特征学习研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于记忆的不变图像特征学习方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于相依数据的梯度学习理论研究

国家自然科学基金

1+阅读 · 2015年12月31日

面向大数据的安全迁移学习方法

国家自然科学基金

31+阅读 · 2015年12月31日

面向异分布数据的主动学习方法

国家自然科学基金

12+阅读 · 2015年12月31日

面向地理模型集成与运行的数据适配方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于图像特征的接收函数各向异性反演研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向基因组相关性研究的迁移学习理论与方法

国家自然科学基金

0+阅读 · 2014年12月31日

Transfer Learning Through Conditional Quantile Matching

Arxiv

0+阅读 · 2月2日

Transfer learning for scalar-on-function regression via control variates

Arxiv

0+阅读 · 1月23日

LATTLE: LLM Attention Transplant for Transfer Learning of Tabular Data Across Disparate Domains

Arxiv

0+阅读 · 1月23日

An Empirical Study on Ensemble-Based Transfer Learning Bayesian Optimisation with Mixed Variable Types

Arxiv

0+阅读 · 1月22日

Transfer Learning for Benign Overfitting in High-Dimensional Linear Regression

Arxiv

0+阅读 · 1月16日

Image-to-Video Transfer Learning based on Image-Language Foundation Models: A Comprehensive Survey

Arxiv

0+阅读 · 1月12日

Analyzing and Improving Cross-lingual Knowledge Transfer for Machine Translation

Arxiv

0+阅读 · 1月7日

How Training Data Shapes the Use of Parametric and In-Context Knowledge in Language Models

Arxiv

0+阅读 · 1月7日

Tessellation Localized Transfer learning for nonparametric regression

Arxiv

0+阅读 · 1月2日

Characterization of Transfer Using Multi-task Learning Curves

Arxiv

0+阅读 · 2025年12月31日

VIP会员

文章信息

相关主题

相关VIP内容

《可信迁移学习：综述》

《可信迁移学习：综述》

专知会员服务

28+阅读 · 2024年12月20日

【牛津大学博士论文】序列决策中的迁移学习

【牛津大学博士论文】序列决策中的迁移学习

专知会员服务

24+阅读 · 2024年11月10日

面向工业监控典型监督任务的深度迁移学习方法：现状、挑战与展望

面向工业监控典型监督任务的深度迁移学习方法：现状、挑战与展望

专知会员服务

38+阅读 · 2023年1月8日

【AAAI2023】图上的非独立同分布迁移学习

【AAAI2023】图上的非独立同分布迁移学习

专知会员服务

24+阅读 · 2022年12月25日

【CMU博士论文】缓解负迁移提高迁移学习的泛化和效率，201页pdf

【CMU博士论文】缓解负迁移提高迁移学习的泛化和效率，201页pdf

专知会员服务

57+阅读 · 2022年4月19日

贝叶斯迁移学习: 迁移学习的概率图模型概述

贝叶斯迁移学习: 迁移学习的概率图模型概述

专知会员服务

70+阅读 · 2021年10月17日

最新《深度强化学习中的迁移学习》综述论文

最新《深度强化学习中的迁移学习》综述论文

专知会员服务

157+阅读 · 2020年9月20日

【IBM】在视觉和关系推理中迁移学习，Transfer Learning in Visual and Relational Reasoning

【IBM】在视觉和关系推理中迁移学习，Transfer Learning in Visual and Relational Reasoning

专知会员服务

45+阅读 · 2020年1月15日

【论文|迁移自适应学习综述】Transfer Adaptation Learning: A Decade Survey

【论文|迁移自适应学习综述】Transfer Adaptation Learning: A Decade Survey

专知会员服务

45+阅读 · 2019年11月26日

【中科院计算所】迁移学习全面综述论文，A Comprehensive Survey on Transfer Learning，27页pdf，171篇参考文献

【中科院计算所】迁移学习全面综述论文，A Comprehensive Survey on Transfer Learning，27页pdf，171篇参考文献

专知会员服务

99+阅读 · 2019年11月11日

热门VIP内容

开通专知VIP会员享更多权益服务

美国防部门开始扩建金穹反导系统基础设施

《基于选择性深度神经网络分类的弹性无线通信》最新报告

《多域作战中融合网络、电子战与动能机动》

《在东欧磨砺反无人机技能》美陆军最新反无人机训练报告

相关资讯

中科院发布最新迁移学习综述论文，带你全面了解40种迁移学习方法

中科院发布最新迁移学习综述论文，带你全面了解40种迁移学习方法

专知

48+阅读 · 2019年11月12日

八千字长文深度解读，迁移学习在强化学习中的应用及最新进展

八千字长文深度解读，迁移学习在强化学习中的应用及最新进展

机器之心

13+阅读 · 2019年10月17日

迁移自适应学习最新综述，附21页论文下载

迁移自适应学习最新综述，附21页论文下载

专知

34+阅读 · 2019年3月13日

一文了解迁移学习经典算法

一文了解迁移学习经典算法

AI100

11+阅读 · 2018年8月4日

【迁移学习】简述迁移学习在深度学习中的应用

【迁移学习】简述迁移学习在深度学习中的应用

产业智能官

15+阅读 · 2018年1月9日

【迁移学习】迁移学习的干货学习资料 | 干货分享 | 技术解读

【迁移学习】迁移学习的干货学习资料 | 干货分享 | 技术解读

产业智能官

15+阅读 · 2018年1月2日

迁移学习在深度学习中的应用

迁移学习在深度学习中的应用

专知

24+阅读 · 2017年12月24日

什么是迁移学习？它都用在深度学习的哪些场景上？这篇文章替你讲清楚了

什么是迁移学习？它都用在深度学习的哪些场景上？这篇文章替你讲清楚了

AI100

16+阅读 · 2017年12月23日

深度 | 迁移学习全面概述：从基本概念到相关研究

深度 | 迁移学习全面概述：从基本概念到相关研究

七月在线实验室

15+阅读 · 2017年8月15日

独家 | 一文读懂迁移学习（附学习工具包）

独家 | 一文读懂迁移学习（附学习工具包）

数据派THU

13+阅读 · 2017年7月13日

相关论文

Transfer Learning Through Conditional Quantile Matching

Arxiv

0+阅读 · 2月2日

Transfer learning for scalar-on-function regression via control variates

Arxiv

0+阅读 · 1月23日

LATTLE: LLM Attention Transplant for Transfer Learning of Tabular Data Across Disparate Domains

Arxiv

0+阅读 · 1月23日

An Empirical Study on Ensemble-Based Transfer Learning Bayesian Optimisation with Mixed Variable Types

Arxiv

0+阅读 · 1月22日

Transfer Learning for Benign Overfitting in High-Dimensional Linear Regression

Arxiv

0+阅读 · 1月16日

Image-to-Video Transfer Learning based on Image-Language Foundation Models: A Comprehensive Survey

Arxiv

0+阅读 · 1月12日

Analyzing and Improving Cross-lingual Knowledge Transfer for Machine Translation

Arxiv

0+阅读 · 1月7日

How Training Data Shapes the Use of Parametric and In-Context Knowledge in Language Models

Arxiv

0+阅读 · 1月7日

Tessellation Localized Transfer learning for nonparametric regression

Arxiv

0+阅读 · 1月2日

Characterization of Transfer Using Multi-task Learning Curves

Arxiv

0+阅读 · 2025年12月31日

相关基金

面向推荐系统中异构隐式反馈建模的迁移学习技术研究

国家自然科学基金

5+阅读 · 2015年12月31日

基于迁移学习的图像隐写分析新方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于生态演替的文本大数据特征学习研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于记忆的不变图像特征学习方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于相依数据的梯度学习理论研究

国家自然科学基金

1+阅读 · 2015年12月31日

面向大数据的安全迁移学习方法

国家自然科学基金

31+阅读 · 2015年12月31日

面向异分布数据的主动学习方法

国家自然科学基金

12+阅读 · 2015年12月31日

面向地理模型集成与运行的数据适配方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于图像特征的接收函数各向异性反演研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向基因组相关性研究的迁移学习理论与方法

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员