Understanding the Spectral Bias of Coordinate Based MLPs Via Training Dynamics - 专知论文

会员服务 ·

0

有偏 · 可理解性 · Networking · Analysis · ReLU ·

2023 年 5 月 4 日

Understanding the Spectral Bias of Coordinate Based MLPs Via Training Dynamics

翻译：基于训练动力学的坐标基MLP谱偏置理解

John Lazzari,Xiuwen Liu

from arxiv, 8 pages, 10 figures, preprint

Spectral bias is an important observation of neural network training, stating that the network will learn a low frequency representation of the target function before converging to higher frequency components. This property is interesting due to its link to good generalization in over-parameterized networks. However, in low dimensional settings, a severe spectral bias occurs that obstructs convergence to high frequency components entirely. In order to overcome this limitation, one can encode the inputs using a high frequency sinusoidal encoding. Previous works attempted to explain this phenomenon using Neural Tangent Kernel (NTK) and Fourier analysis. However, NTK does not capture real network dynamics, and Fourier analysis only offers a global perspective on the network properties that induce this bias. In this paper, we provide a novel approach towards understanding spectral bias by directly studying ReLU MLP training dynamics. Specifically, we focus on the connection between the computations of ReLU networks (activation regions), and the speed of gradient descent convergence. We study these dynamics in relation to the spatial information of the signal to understand how they influence spectral bias. We then use this formulation to study the severity of spectral bias in low dimensional settings, and how positional encoding overcomes this.

翻译：谱偏置是神经网络训练中的一个重要发现，它表明网络在收敛到高频分量之前，会先学习目标函数的低频表示。这一性质因其与过参数化网络的良好泛化能力相关联而备受关注。然而，在低维场景中，严重的谱偏置会完全阻碍网络向高频分量的收敛。为克服这一局限，可采用高频正弦编码对输入进行编码。现有研究尝试利用神经正切核（NTK）与傅里叶分析解释该现象，但NTK无法捕捉真实网络动力学，而傅里叶分析仅能从全局视角揭示引发偏置的网络特性。本文提出一种全新方法，通过直接研究ReLU MLP的训练动力学来理解谱偏置。具体而言，我们聚焦于ReLU网络的计算过程（激活区域）与梯度下降收敛速度之间的关联，并基于信号空间信息分析这些动力学特性如何影响谱偏置。进而利用该理论框架，探究低维场景中谱偏置的严重程度以及位置编码对此问题的缓解机制。

0

相关内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

84+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

PP2Cδ调控的线粒体ROS通路在肺损伤和炎症中的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于能量传递的宽带太阳光谱调制近红外下转换CaNb2O6发光薄膜研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于氧化锌微米线与银薄膜的表面等离子体Fabry-Perot微腔研究

国家自然科学基金

0+阅读 · 2013年12月31日

磷脂酶C-γ2（PLCG2）调节大鼠再生肝的肝细胞凋亡机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

Partial Spread Bent函数与Bent-Negabent函数的构造及密码学性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

Cofilin在Erucin诱导的乳腺癌细胞线粒体分裂和细胞凋亡中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

亚微米尺度下Beta钛合金单晶力学行为及变形机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

多孔介质中的Brinkman-Forchheimer方程解的稳定性研究

国家自然科学基金

0+阅读 · 2011年12月31日

磁性多层膜的可控离子液体电沉积及层间磁耦合

国家自然科学基金

0+阅读 · 2011年12月31日

基于OLED光源和离子交换波导的表面等离子体传感器研究

国家自然科学基金

0+阅读 · 2010年12月31日

LEAD: Min-Max Optimization from a Physical Perspective

Arxiv

0+阅读 · 2023年6月21日

Generalizing the de Finetti--Hewitt--Savage theorem

Arxiv

0+阅读 · 2023年6月20日

One-to-Many Spectral Upsampling of Reflectances and Transmittances

Arxiv

0+阅读 · 2023年6月20日

Learning Models of Adversarial Agent Behavior under Partial Observability

Arxiv

0+阅读 · 2023年6月19日

Weisfeiler--Leman and Graph Spectra

Arxiv

0+阅读 · 2023年6月19日

Learning Neural Constitutive Laws From Motion Observations for Generalizable PDE Dynamics

Arxiv

0+阅读 · 2023年6月16日

Graph Ordering Attention Networks

Arxiv

12+阅读 · 2022年11月21日

Knowledge Graph Embedding: A Survey from the Perspective of Representation Spaces

Arxiv

18+阅读 · 2022年11月7日

NeuroFluid: Fluid Dynamics Grounding with Particle-Driven Neural Radiance Fields

Arxiv

15+阅读 · 2022年3月3日

Distributed Graph Convolutional Networks

Arxiv

19+阅读 · 2020年7月13日

VIP会员

文章信息

相关主题

最新内容

《通用大语言模型：无人机指挥与控制接口》最新40页

《通用大语言模型：无人机指挥与控制接口》最新40页

专知会员服务

0+阅读 · 22分钟前

《通过小型无人机系统将情报能力“作战化”》

《通过小型无人机系统将情报能力“作战化”》

专知会员服务

0+阅读 · 27分钟前

《神经安全型有人–无人协同：面向认知自适应作战能力的参考架构》

《神经安全型有人–无人协同：面向认知自适应作战能力的参考架构》

专知会员服务

0+阅读 · 41分钟前

《在指挥链中通过多准则决策分析传达指挥官意图：空战实验》

《在指挥链中通过多准则决策分析传达指挥官意图：空战实验》

专知会员服务

15+阅读 · 6月15日

消耗优势：美军的“精确规模化”概念

消耗优势：美军的“精确规模化”概念

专知会员服务

7+阅读 · 6月15日

五角大楼的AI优先战略及其对现代战争的启示：来自与伊朗冲突的经验教训

五角大楼的AI优先战略及其对现代战争的启示：来自与伊朗冲突的经验教训

专知会员服务

8+阅读 · 6月15日

《网络空间兵棋推演：挑战、局限性与混合路径》报告

《网络空间兵棋推演：挑战、局限性与混合路径》报告

专知会员服务

8+阅读 · 6月15日

《离线语言支持系统：面向空战战术决策》

《离线语言支持系统：面向空战战术决策》

专知会员服务

8+阅读 · 6月15日

《以通信为中心的6G–LLM架构：面向可扩展的战术自主防御车辆网络》

《以通信为中心的6G–LLM架构：面向可扩展的战术自主防御车辆网络》

专知会员服务

6+阅读 · 6月15日

ICML 2026｜ECA：面向开放式图文生成的高效持续对齐

ICML 2026｜ECA：面向开放式图文生成的高效持续对齐

专知会员服务

6+阅读 · 6月14日

可信智能体AI综述：安全、鲁棒性、隐私与系统安全

可信智能体AI综述：安全、鲁棒性、隐私与系统安全

专知会员服务

6+阅读 · 6月14日

俄乌战场地面机器人如何改写战争规则

俄乌战场地面机器人如何改写战争规则

专知会员服务

9+阅读 · 6月14日

美国海军研究生院第23届年度采购研究研讨会与创新峰会：主题“加速作战能力”，附会议报告论文集1300页

美国海军研究生院第23届年度采购研究研讨会与创新峰会：主题“加速作战能力”，附会议报告论文集1300页

专知会员服务

13+阅读 · 6月14日

《新空中力量概念：来自敏捷战斗运用的启示》2026最新50页报告

《新空中力量概念：来自敏捷战斗运用的启示》2026最新50页报告

专知会员服务

14+阅读 · 6月14日

《无人水面艇文献综述与结构设计》135页

《无人水面艇文献综述与结构设计》135页

专知会员服务

16+阅读 · 6月13日

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

84+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《通过小型无人机系统将情报能力“作战化”》

《在指挥链中通过多准则决策分析传达指挥官意图：空战实验》

《通用大语言模型：无人机指挥与控制接口》最新40页

《神经安全型有人–无人协同：面向认知自适应作战能力的参考架构》

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

LEAD: Min-Max Optimization from a Physical Perspective

Arxiv

0+阅读 · 2023年6月21日

Generalizing the de Finetti--Hewitt--Savage theorem

Arxiv

0+阅读 · 2023年6月20日

One-to-Many Spectral Upsampling of Reflectances and Transmittances

Arxiv

0+阅读 · 2023年6月20日

Learning Models of Adversarial Agent Behavior under Partial Observability

Arxiv

0+阅读 · 2023年6月19日

Weisfeiler--Leman and Graph Spectra

Arxiv

0+阅读 · 2023年6月19日

Learning Neural Constitutive Laws From Motion Observations for Generalizable PDE Dynamics

Arxiv

0+阅读 · 2023年6月16日

Graph Ordering Attention Networks

Arxiv

12+阅读 · 2022年11月21日

Knowledge Graph Embedding: A Survey from the Perspective of Representation Spaces

Arxiv

18+阅读 · 2022年11月7日

NeuroFluid: Fluid Dynamics Grounding with Particle-Driven Neural Radiance Fields

Arxiv

15+阅读 · 2022年3月3日

Distributed Graph Convolutional Networks

Arxiv

19+阅读 · 2020年7月13日

相关基金

PP2Cδ调控的线粒体ROS通路在肺损伤和炎症中的作用机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于能量传递的宽带太阳光谱调制近红外下转换CaNb2O6发光薄膜研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于氧化锌微米线与银薄膜的表面等离子体Fabry-Perot微腔研究

国家自然科学基金

0+阅读 · 2013年12月31日

磷脂酶C-γ2（PLCG2）调节大鼠再生肝的肝细胞凋亡机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

Partial Spread Bent函数与Bent-Negabent函数的构造及密码学性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

Cofilin在Erucin诱导的乳腺癌细胞线粒体分裂和细胞凋亡中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

亚微米尺度下Beta钛合金单晶力学行为及变形机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

多孔介质中的Brinkman-Forchheimer方程解的稳定性研究

国家自然科学基金

0+阅读 · 2011年12月31日

磁性多层膜的可控离子液体电沉积及层间磁耦合

国家自然科学基金

0+阅读 · 2011年12月31日

基于OLED光源和离子交换波导的表面等离子体传感器研究

国家自然科学基金

0+阅读 · 2010年12月31日

微信扫码咨询专知VIP会员