The Rules-and-Facts Model for Simultaneous Generalization and Memorization in Neural Networks - 专知论文

会员服务 ·

0

泛化 · 结构 · 结构化 · 神经网络 · 潜在 ·

The Rules-and-Facts Model for Simultaneous Generalization and Memorization in Neural Networks

翻译：神经网络中同步泛化与记忆的规则-事实模型

Gabriele Farné,Fabrizio Boncoraglio,Lenka Zdeborová

A key capability of modern neural networks is their capacity to simultaneously learn underlying rules and memorize specific facts or exceptions. Yet, theoretical understanding of this dual capability remains limited. We introduce the Rules-and-Facts (RAF) model, a minimal solvable setting that enables precise characterization of this phenomenon by bridging two classical lines of work in the statistical physics of learning: the teacher-student framework for generalization and Gardner-style capacity analysis for memorization. In the RAF model, a fraction $1 - \varepsilon$ of training labels is generated by a structured teacher rule, while a fraction $\varepsilon$ consists of unstructured facts with random labels. We characterize when the learner can simultaneously recover the underlying rule - allowing generalization to new data - and memorize the unstructured examples. Our results quantify how overparameterization enables the simultaneous realization of these two objectives: sufficient excess capacity supports memorization, while regularization and the choice of kernel or nonlinearity control the allocation of capacity between rule learning and memorization. The RAF model provides a theoretical foundation for understanding how modern neural networks can infer structure while storing rare or non-compressible information.

翻译：现代神经网络的一项关键能力是能够同时学习潜在规则并记忆特定事实或异常情况。然而，对这种双重能力的理论理解仍然有限。我们引入了规则-事实（RAF）模型，这是一个最小可解设定，通过桥接统计物理学习中两条经典研究路线——用于泛化的师生框架和用于记忆的Gardner式容量分析——能够精确刻画这一现象。在RAF模型中，训练标签的$1 - \varepsilon$部分由结构化教师规则生成，而$\varepsilon$部分则由带有随机标签的非结构化事实组成。我们刻画了学习者何时能够同时恢复潜在规则（从而泛化到新数据）并记忆非结构化样本。我们的结果量化了过参数化如何使这两个目标得以同步实现：充足的过剩容量支持记忆，而正则化以及核函数或非线性的选择则控制着容量在规则学习与记忆之间的分配。RAF模型为理解现代神经网络如何在存储罕见或不可压缩信息的同时推断结构提供了理论基础。

0

相关内容

【牛津大学博士论文】超参数化神经网络的泛化与表达性，221页pdf

【牛津大学博士论文】超参数化神经网络的泛化与表达性，221页pdf

专知会员服务

32+阅读 · 2024年4月19日

【UCLA博士论文】神经网络捕获的信息:与记忆和泛化的联系，143页pdf

【UCLA博士论文】神经网络捕获的信息:与记忆和泛化的联系，143页pdf

专知会员服务

41+阅读 · 2023年7月3日

【伯克利JD Co-Reyes博士论文】建立强化学习算法泛化:从潜在动力学模型到元学习，Building Reinforcement Learning Algorithms that Generalize: From Latent Dynamics Models to Meta-Learning

【伯克利JD Co-Reyes博士论文】建立强化学习算法泛化:从潜在动力学模型到元学习，Building Reinforcement Learning Algorithms that Generalize: From Latent Dynamics Models to Meta-Learning

专知会员服务

45+阅读 · 2022年3月6日

【香港中文大学&华为等】双曲图神经网络:方法与应用综述，Hyperbolic Graph Neural Networks: A Review of Methods and Applications

【香港中文大学&华为等】双曲图神经网络:方法与应用综述，Hyperbolic Graph Neural Networks: A Review of Methods and Applications

专知会员服务

21+阅读 · 2022年3月2日

【NeurIPS 2021 】学习理论(有时)可以解释图神经网络中的泛化

【NeurIPS 2021 】学习理论(有时)可以解释图神经网络中的泛化

专知会员服务

30+阅读 · 2021年12月13日

【论文推荐】二值神经网络综述，Binary Neural Networks: A Survey

【论文推荐】二值神经网络综述，Binary Neural Networks: A Survey

专知会员服务

53+阅读 · 2020年4月8日

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

专知会员服务

46+阅读 · 2020年2月23日

深度神经网络模型的个体差异，Individual differences among deep neural network models

深度神经网络模型的个体差异，Individual differences among deep neural network models

专知会员服务

10+阅读 · 2020年1月11日

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

专知会员服务

104+阅读 · 2019年12月30日

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

专知会员服务

35+阅读 · 2019年12月12日

基于模型的强化学习综述

基于模型的强化学习综述

专知

42+阅读 · 2022年7月13日

最新《图卷积神经网络》中文综述论文，26页pdf，计算机学报-中科院计算所

最新《图卷积神经网络》中文综述论文，26页pdf，计算机学报-中科院计算所

专知

36+阅读 · 2020年5月19日

深度神经网络可解释性方法汇总，附Tensorflow代码实现

深度神经网络可解释性方法汇总，附Tensorflow代码实现

新智元

34+阅读 · 2019年11月7日

神经网络常微分方程 (Neural ODEs) 解析

神经网络常微分方程 (Neural ODEs) 解析

AI科技评论

42+阅读 · 2019年8月9日

2019年新书推荐-《神经网络与深度学习》-Michael Nielsen

2019年新书推荐-《神经网络与深度学习》-Michael Nielsen

深度学习与NLP

14+阅读 · 2019年2月21日

这有一份花书《深度学习》笔记，深度学习规则，帮你抓住精髓！(附下载)

这有一份花书《深度学习》笔记，深度学习规则，帮你抓住精髓！(附下载)

专知

42+阅读 · 2019年1月7日

图神经网络最近这么火，不妨看看我们精选的这七篇

图神经网络最近这么火，不妨看看我们精选的这七篇

人工智能前沿讲习班

37+阅读 · 2018年12月10日

超全总结：神经网络加速之量化模型 | 附带代码

超全总结：神经网络加速之量化模型 | 附带代码

PaperWeekly

12+阅读 · 2018年6月1日

论文浅尝 | 基于神经网络的推理（DeepMind Relational Reasoning）

论文浅尝 | 基于神经网络的推理（DeepMind Relational Reasoning）

开放知识图谱

14+阅读 · 2018年4月3日

Coursera吴恩达《卷积神经网络》课程笔记（1）-- 卷积神经网络基础

Coursera吴恩达《卷积神经网络》课程笔记（1）-- 卷积神经网络基础

机器学习研究会

29+阅读 · 2018年1月29日

循环神经网络多模态深度模型联想记忆功能研究

国家自然科学基金

6+阅读 · 2017年12月31日

忆阻递归神经网络的多重稳定性理论研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向人类工作记忆改善的脑电复杂网络信息反馈非线性计算模型研究

国家自然科学基金

0+阅读 · 2015年12月31日

T-S模糊神经网络的容错同步性分析

国家自然科学基金

0+阅读 · 2015年12月31日

一对多联想记忆中的细胞神经网络建模及参数获取方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

多尺度模块网络下的储备池神经计算模型及算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

关联规则集上的知识发现

国家自然科学基金

9+阅读 · 2015年12月31日

学习与记忆的神经动力学研究

国家自然科学基金

1+阅读 · 2014年12月31日

反馈神经网络统一模型临界动力学研究及其在类脑计算机研制中的应用

国家自然科学基金

1+阅读 · 2014年12月31日

非凸非光滑优化的神经网络设计及其关键问题研究

国家自然科学基金

0+阅读 · 2014年12月31日

Algorithm-hardware co-design of neuromorphic networks with dual memory pathways

Arxiv

0+阅读 · 5月2日

Physical Foundation Models: Fixed hardware implementations of large-scale neural networks

Arxiv

0+阅读 · 4月30日

Multiple Additive Neural Networks for Structured and Unstructured Data

Arxiv

0+阅读 · 4月29日

Cortex-Inspired Continual Learning: Unsupervised Instantiation and Recovery of Functional Task Networks

Arxiv

0+阅读 · 4月27日

Relaxation-Informed Training of Neural Network Surrogate Models

Arxiv

0+阅读 · 4月24日

On the Theory of Continual Learning with Gradient Descent for Neural Networks

Arxiv

0+阅读 · 4月20日

HiPreNets: High-Precision Neural Networks through Progressive Training

Arxiv

0+阅读 · 4月17日

OmniFysics: Towards Physical Intelligence Evolution via Omni-Modal Signal Processing and Network Optimization

Arxiv

0+阅读 · 4月7日

A Model of Understanding in Deep Learning Systems

Arxiv

0+阅读 · 4月5日

Associative Memory using Attribute-Specific Neuron Groups-2: Learning and Sequential Associative Recall between Cue Neurons for different Cue Balls

Arxiv

0+阅读 · 3月26日

VIP会员

文章信息

相关主题

最新内容

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

专知会员服务

1+阅读 · 今天14:45

综述 | 世界动作模型：少做梦，多行动

综述 | 世界动作模型：少做梦，多行动

专知会员服务

1+阅读 · 今天14:43

美以伊冲突：无人机与人工智能的运用

美以伊冲突：无人机与人工智能的运用

专知会员服务

3+阅读 · 今天14:31

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

专知会员服务

3+阅读 · 今天14:20

《特种部队在透明战场中的生存力》最新报告

《特种部队在透明战场中的生存力》最新报告

专知会员服务

2+阅读 · 今天14:11

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

《自主无人机蜂群协同与控制系统：人工智能赋能的战场协同与自主任务编排平台》

专知会员服务

3+阅读 · 今天14:07

《人工智能生成的零日漏洞：对未来作战的影响》

《人工智能生成的零日漏洞：对未来作战的影响》

专知会员服务

3+阅读 · 今天14:03

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

《理解伙伴国在防务能力选择中的偏好：探索美国解决方案的替代选择》美智库200页报告

专知会员服务

2+阅读 · 今天13:59

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

5+阅读 · 6月22日

综述 | 3D场景图：开放挑战与未来方向

综述 | 3D场景图：开放挑战与未来方向

专知会员服务

8+阅读 · 6月22日

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

专知会员服务

7+阅读 · 6月22日

21世纪的无人机战争

21世纪的无人机战争

专知会员服务

4+阅读 · 6月22日

《伊朗与以色列-美国热战及其对数字技术的影响》

《伊朗与以色列-美国热战及其对数字技术的影响》

专知会员服务

5+阅读 · 6月22日

《量子技术的军事任务技术适配与利用》

《量子技术的军事任务技术适配与利用》

专知会员服务

5+阅读 · 6月22日

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

专知会员服务

8+阅读 · 6月22日

相关VIP内容

【牛津大学博士论文】超参数化神经网络的泛化与表达性，221页pdf

【牛津大学博士论文】超参数化神经网络的泛化与表达性，221页pdf

专知会员服务

32+阅读 · 2024年4月19日

【UCLA博士论文】神经网络捕获的信息:与记忆和泛化的联系，143页pdf

【UCLA博士论文】神经网络捕获的信息:与记忆和泛化的联系，143页pdf

专知会员服务

41+阅读 · 2023年7月3日

【伯克利JD Co-Reyes博士论文】建立强化学习算法泛化:从潜在动力学模型到元学习，Building Reinforcement Learning Algorithms that Generalize: From Latent Dynamics Models to Meta-Learning

【伯克利JD Co-Reyes博士论文】建立强化学习算法泛化:从潜在动力学模型到元学习，Building Reinforcement Learning Algorithms that Generalize: From Latent Dynamics Models to Meta-Learning

专知会员服务

45+阅读 · 2022年3月6日

【香港中文大学&华为等】双曲图神经网络:方法与应用综述，Hyperbolic Graph Neural Networks: A Review of Methods and Applications

【香港中文大学&华为等】双曲图神经网络:方法与应用综述，Hyperbolic Graph Neural Networks: A Review of Methods and Applications

专知会员服务

21+阅读 · 2022年3月2日

【NeurIPS 2021 】学习理论(有时)可以解释图神经网络中的泛化

【NeurIPS 2021 】学习理论(有时)可以解释图神经网络中的泛化

专知会员服务

30+阅读 · 2021年12月13日

【论文推荐】二值神经网络综述，Binary Neural Networks: A Survey

【论文推荐】二值神经网络综述，Binary Neural Networks: A Survey

专知会员服务

53+阅读 · 2020年4月8日

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

专知会员服务

46+阅读 · 2020年2月23日

深度神经网络模型的个体差异，Individual differences among deep neural network models

深度神经网络模型的个体差异，Individual differences among deep neural network models

专知会员服务

10+阅读 · 2020年1月11日

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

【论文】用于推理的概率逻辑神经网络（Probabilistic Logic Neural Networks for Reasoning）

专知会员服务

104+阅读 · 2019年12月30日

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

【伯克利，基于模型的强化学习：理论与实践】《Model-Based Reinforcement Learning:Theory and Practice》，Michael Janner

专知会员服务

35+阅读 · 2019年12月12日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 世界动作模型：少做梦，多行动

《战时图神经网络：整合以色列-伊朗冲突中的网络安全与无人机智能》最新50页文献

ICML 2026 | CFPO：用反事实策略优化提升多模态推理

美以伊冲突：无人机与人工智能的运用

相关资讯

基于模型的强化学习综述

基于模型的强化学习综述

专知

42+阅读 · 2022年7月13日

最新《图卷积神经网络》中文综述论文，26页pdf，计算机学报-中科院计算所

最新《图卷积神经网络》中文综述论文，26页pdf，计算机学报-中科院计算所

专知

36+阅读 · 2020年5月19日

深度神经网络可解释性方法汇总，附Tensorflow代码实现

深度神经网络可解释性方法汇总，附Tensorflow代码实现

新智元

34+阅读 · 2019年11月7日

神经网络常微分方程 (Neural ODEs) 解析

神经网络常微分方程 (Neural ODEs) 解析

AI科技评论

42+阅读 · 2019年8月9日

2019年新书推荐-《神经网络与深度学习》-Michael Nielsen

2019年新书推荐-《神经网络与深度学习》-Michael Nielsen

深度学习与NLP

14+阅读 · 2019年2月21日

这有一份花书《深度学习》笔记，深度学习规则，帮你抓住精髓！(附下载)

这有一份花书《深度学习》笔记，深度学习规则，帮你抓住精髓！(附下载)

专知

42+阅读 · 2019年1月7日

图神经网络最近这么火，不妨看看我们精选的这七篇

图神经网络最近这么火，不妨看看我们精选的这七篇

人工智能前沿讲习班

37+阅读 · 2018年12月10日

超全总结：神经网络加速之量化模型 | 附带代码

超全总结：神经网络加速之量化模型 | 附带代码

PaperWeekly

12+阅读 · 2018年6月1日

论文浅尝 | 基于神经网络的推理（DeepMind Relational Reasoning）

论文浅尝 | 基于神经网络的推理（DeepMind Relational Reasoning）

开放知识图谱

14+阅读 · 2018年4月3日

Coursera吴恩达《卷积神经网络》课程笔记（1）-- 卷积神经网络基础

Coursera吴恩达《卷积神经网络》课程笔记（1）-- 卷积神经网络基础

机器学习研究会

29+阅读 · 2018年1月29日

相关论文

Algorithm-hardware co-design of neuromorphic networks with dual memory pathways

Arxiv

0+阅读 · 5月2日

Physical Foundation Models: Fixed hardware implementations of large-scale neural networks

Arxiv

0+阅读 · 4月30日

Multiple Additive Neural Networks for Structured and Unstructured Data

Arxiv

0+阅读 · 4月29日

Cortex-Inspired Continual Learning: Unsupervised Instantiation and Recovery of Functional Task Networks

Arxiv

0+阅读 · 4月27日

Relaxation-Informed Training of Neural Network Surrogate Models

Arxiv

0+阅读 · 4月24日

On the Theory of Continual Learning with Gradient Descent for Neural Networks

Arxiv

0+阅读 · 4月20日

HiPreNets: High-Precision Neural Networks through Progressive Training

Arxiv

0+阅读 · 4月17日

OmniFysics: Towards Physical Intelligence Evolution via Omni-Modal Signal Processing and Network Optimization

Arxiv

0+阅读 · 4月7日

A Model of Understanding in Deep Learning Systems

Arxiv

0+阅读 · 4月5日

Associative Memory using Attribute-Specific Neuron Groups-2: Learning and Sequential Associative Recall between Cue Neurons for different Cue Balls

Arxiv

0+阅读 · 3月26日

相关基金

循环神经网络多模态深度模型联想记忆功能研究

国家自然科学基金

6+阅读 · 2017年12月31日

忆阻递归神经网络的多重稳定性理论研究

国家自然科学基金

0+阅读 · 2015年12月31日

面向人类工作记忆改善的脑电复杂网络信息反馈非线性计算模型研究

国家自然科学基金

0+阅读 · 2015年12月31日

T-S模糊神经网络的容错同步性分析

国家自然科学基金

0+阅读 · 2015年12月31日

一对多联想记忆中的细胞神经网络建模及参数获取方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

多尺度模块网络下的储备池神经计算模型及算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

关联规则集上的知识发现

国家自然科学基金

9+阅读 · 2015年12月31日

学习与记忆的神经动力学研究

国家自然科学基金

1+阅读 · 2014年12月31日

反馈神经网络统一模型临界动力学研究及其在类脑计算机研制中的应用

国家自然科学基金

1+阅读 · 2014年12月31日

非凸非光滑优化的神经网络设计及其关键问题研究

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员