Precise Learning Curves and Higher-Order Scaling Limits for Dot Product Kernel Regression - 专知论文

会员服务 ·

0

查准率/准确率 · 缩放 · 核化 · Learning · 核回归 ·

2023 年 6 月 3 日

Precise Learning Curves and Higher-Order Scaling Limits for Dot Product Kernel Regression

翻译：精确的点积核回归学习曲线与高阶标度极限

Lechao Xiao,Hong Hu,Theodor Misiakiewicz,Yue M. Lu,Jeffrey Pennington

from arxiv, 32 pages; 4 + 3 figures

As modern machine learning models continue to advance the computational frontier, it has become increasingly important to develop precise estimates for expected performance improvements under different model and data scaling regimes. Currently, theoretical understanding of the learning curves that characterize how the prediction error depends on the number of samples is restricted to either large-sample asymptotics ($m\to\infty$) or, for certain simple data distributions, to the high-dimensional asymptotics in which the number of samples scales linearly with the dimension ($m\propto d$). There is a wide gulf between these two regimes, including all higher-order scaling relations $m\propto d^r$, which are the subject of the present paper. We focus on the problem of kernel ridge regression for dot-product kernels and present precise formulas for the test error, bias, and variance, for data drawn uniformly from the sphere in the $r$th-order asymptotic scaling regime $m\to\infty$ with $m/d^r$ held constant. We observe a peak in the learning curve whenever $m \approx d^r/r!$ for any integer $r$, leading to multiple sample-wise descent and nontrivial behavior at multiple scales.

翻译：随着现代机器学习模型不断推进计算前沿，开发不同模型与数据标度模式下预期性能提升的精确估计变得日益重要。目前，描述预测误差如何依赖样本数量的学习曲线的理论理解，要么局限于大样本渐近（$m\to\infty$），要么针对某些简单数据分布局限于高维渐近（样本数量与维度呈线性标度，即$m\propto d$）。这两个标度区间之间存在巨大鸿沟，包含所有高阶标度关系$m\propto d^r$——这正是本文的研究对象。我们聚焦于点积核的核岭回归问题，并在$r$阶渐近标度区间$m\to\infty$且$m/d^r$保持恒定的条件下，给出了从球面上均匀抽取数据的测试误差、偏差与方差的精确公式。我们观察到，当$m \approx d^r/r!$（对任意整数$r$）时，学习曲线出现峰值，导致多重样本量下降及多尺度下的非平凡行为。

0

相关内容

查准率/准确率

查准率/准确率

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

稀土硫氧化物上转换荧光探针的一步合成与生物成像研究

国家自然科学基金

0+阅读 · 2015年12月31日

内质网应激IRE1－XBP1S通路在高糖引起肾脏及系膜细胞发生氧化应激及损伤中的机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

新型多孔复合材料

国家自然科学基金

0+阅读 · 2013年12月31日

草酸盐转运体SLC26A6在特发性草酸钙结石发病中的作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

孤儿核受体ERRalpha作为转移性去势抵抗性前列腺癌治疗靶标的探索性研究

国家自然科学基金

0+阅读 · 2013年12月31日

椿皮中苦木内酯类成分抑制HER2的作用机制和构效关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

PE相关分子miR-18b的功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

腰椎小关节退变在成人退变性腰椎侧凸发病机制中的作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

碳纳米管量子点中自旋－轨道耦合作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

Dyrk1A调控CaMKⅡ#948;的可变剪接及其在心脏重构过程中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

Nonparametric Linear Feature Learning in Regression Through Regularisation

Nonparametric Linear Feature Learning in Regression Through Regularisation

Arxiv

0+阅读 · 2023年7月25日

Perturbed Initial Orbit Determination

Arxiv

0+阅读 · 2023年7月25日

A zero-estimator approach for estimating the signal level in a high-dimensional regression setting

Arxiv

0+阅读 · 2023年7月25日

Dictionary Learning under Symmetries via Group Representations

Arxiv

0+阅读 · 2023年7月25日

Improved Rates of Bootstrap Approximation for the Operator Norm: A Coordinate-Free Approach

Arxiv

0+阅读 · 2023年7月25日

Generalizing similarity in noisy setups: the DIBS phenomenon

Generalizing similarity in noisy setups: the DIBS phenomenon

Arxiv

0+阅读 · 2023年7月24日

Multifidelity Covariance Estimation via Regression on the Manifold of Symmetric Positive Definite Matrices

Arxiv

0+阅读 · 2023年7月23日

Statistical analysis for a penalized EM algorithm in high-dimensional mixture linear regression model

Arxiv

0+阅读 · 2023年7月21日

Exact recovery for the non-uniform Hypergraph Stochastic Block Model

Arxiv

0+阅读 · 2023年7月20日

Exact Community Recovery in the Geometric SBM

Arxiv

0+阅读 · 2023年7月20日

VIP会员

文章信息

相关主题

查准率/准确率

最新内容

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

专知会员服务

1+阅读 · 今天14:45

综述 | 世界动作模型：少做梦，多行动

综述 | 世界动作模型：少做梦，多行动

专知会员服务

1+阅读 · 今天14:43

美以伊冲突：无人机与人工智能的运用

美以伊冲突：无人机与人工智能的运用

专知会员服务

3+阅读 · 今天14:31

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

专知会员服务

3+阅读 · 今天14:20

《特种部队在透明战场中的生存力》最新报告

《特种部队在透明战场中的生存力》最新报告

专知会员服务

2+阅读 · 今天14:11

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

专知会员服务

2+阅读 · 今天14:07

《人工智能生成的零日漏洞：对未来作战的影响》

《人工智能生成的零日漏洞：对未来作战的影响》

专知会员服务

3+阅读 · 今天14:03

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

专知会员服务

2+阅读 · 今天13:59

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

5+阅读 · 6月22日

综述 | 3D场景图：开放挑战与未来方向

综述 | 3D场景图：开放挑战与未来方向

专知会员服务

8+阅读 · 6月22日

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

专知会员服务

7+阅读 · 6月22日

21世纪的无人机战争

21世纪的无人机战争

专知会员服务

4+阅读 · 6月22日

《伊朗与以色列-美国热战及其对数字技术的影响》

《伊朗与以色列-美国热战及其对数字技术的影响》

专知会员服务

5+阅读 · 6月22日

《量子技术的军事任务技术适配与利用》

《量子技术的军事任务技术适配与利用》

专知会员服务

5+阅读 · 6月22日

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

专知会员服务

8+阅读 · 6月22日

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 世界动作模型：少做梦，多行动

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

美以伊冲突：无人机与人工智能的运用

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Nonparametric Linear Feature Learning in Regression Through Regularisation

Nonparametric Linear Feature Learning in Regression Through Regularisation

Arxiv

0+阅读 · 2023年7月25日

Perturbed Initial Orbit Determination

Arxiv

0+阅读 · 2023年7月25日

A zero-estimator approach for estimating the signal level in a high-dimensional regression setting

Arxiv

0+阅读 · 2023年7月25日

Dictionary Learning under Symmetries via Group Representations

Arxiv

0+阅读 · 2023年7月25日

Improved Rates of Bootstrap Approximation for the Operator Norm: A Coordinate-Free Approach

Arxiv

0+阅读 · 2023年7月25日

Generalizing similarity in noisy setups: the DIBS phenomenon

Generalizing similarity in noisy setups: the DIBS phenomenon

Arxiv

0+阅读 · 2023年7月24日

Multifidelity Covariance Estimation via Regression on the Manifold of Symmetric Positive Definite Matrices

Arxiv

0+阅读 · 2023年7月23日

Statistical analysis for a penalized EM algorithm in high-dimensional mixture linear regression model

Arxiv

0+阅读 · 2023年7月21日

Exact recovery for the non-uniform Hypergraph Stochastic Block Model

Arxiv

0+阅读 · 2023年7月20日

Exact Community Recovery in the Geometric SBM

Arxiv

0+阅读 · 2023年7月20日

相关基金

稀土硫氧化物上转换荧光探针的一步合成与生物成像研究

国家自然科学基金

0+阅读 · 2015年12月31日

内质网应激IRE1－XBP1S通路在高糖引起肾脏及系膜细胞发生氧化应激及损伤中的机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

新型多孔复合材料

国家自然科学基金

0+阅读 · 2013年12月31日

草酸盐转运体SLC26A6在特发性草酸钙结石发病中的作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

孤儿核受体ERRalpha作为转移性去势抵抗性前列腺癌治疗靶标的探索性研究

国家自然科学基金

0+阅读 · 2013年12月31日

椿皮中苦木内酯类成分抑制HER2的作用机制和构效关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

PE相关分子miR-18b的功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

腰椎小关节退变在成人退变性腰椎侧凸发病机制中的作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

碳纳米管量子点中自旋－轨道耦合作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

Dyrk1A调控CaMKⅡ#948;的可变剪接及其在心脏重构过程中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员