Scaling Laws and Tradeoffs in Recurrent Networks of Expressive Neurons - 专知论文

会员服务 ·

0

缩放 · MoDELS · Networking · 神经元 · 循环网络 ·

Scaling Laws and Tradeoffs in Recurrent Networks of Expressive Neurons

翻译：暂无翻译

Aaron Spieler,Georg Martius,Anna Levina

from arxiv, 25 pages, 21 figures, 3 tables, including derivations. Submitted for peer review

Cortical neurons are complex, multi-timescale processors wired into recurrent circuits, shaped by long evolutionary pressure under stringent biological constraints. Mainstream machine learning, by contrast, predominantly builds models from extremely simple units, a default inherited from early neural-network theory. We treat this as a normative architectural question. How should one split a fixed parameter budget $P$ between the number of units $N$, per-unit effective complexity $k_e$, and per-unit connectivity $k_c$? What controls the optimal allocation? This calls for a model in which per-unit complexity can be tuned independently of width and connectivity. Accordingly, we introduce the ELM Network, whose recurrent layer is built from Expressive Leaky Memory (ELM) neurons, chosen to mirror functional components of cortical neurons. The architecture allows for individually adjusting $N$, $k_e$, and $k_c$ and trains stably across orders of magnitude in scale. We evaluate the model on two qualitatively different sequence benchmarks: the neuromorphic SHD-Adding task and Enwik8 character-level language modeling. Performance improves monotonically along each of the three axes individually. Under a fixed budget, a clear non-trivial optimum emerges in their tradeoff, and larger budgets favor both more and more complex neurons. A closed-form information-theoretic model captures these tradeoffs and attributes the diminishing returns at two ends to: per-neuron signal-to-noise saturation and across-neuron redundancy. A hyperparameter sweep spanning three orders of magnitude in trainable parameters traces a near-Pareto-frontier scaling law consistent with the framework. This suggests that the simple-unit default in ML is not obviously optimal once this tradeoff surface is probed, and offers a normative lens on cortex's reliance on complex spatio-temporal integrators.

翻译：暂无翻译

0

相关内容

ICLR 2026 | CoT-Evo：面向科学推理的思维链进化蒸馏框架

ICLR 2026 | CoT-Evo：面向科学推理的思维链进化蒸馏框架

专知会员服务

13+阅读 · 2月6日

NeurIPS2023 | AI4Science: Newton–Cotes图神经网络——动态系统上的时间演化

NeurIPS2023 | AI4Science: Newton–Cotes图神经网络——动态系统上的时间演化

专知会员服务

37+阅读 · 2023年10月25日

【NeurIPS2022】序列(推荐)模型分布外泛化：因果视角与求解

【NeurIPS2022】序列(推荐)模型分布外泛化：因果视角与求解

专知会员服务

14+阅读 · 2022年12月11日

【CVPR 2022】跨模态检索的协同双流视觉-语言前训练模型，COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval

【CVPR 2022】跨模态检索的协同双流视觉-语言前训练模型，COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval

专知会员服务

13+阅读 · 2022年3月12日

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

【NeurIPS 2019】多关系庞加莱图嵌入，Multi-relational Poincaré Graph Embeddings

【NeurIPS 2019】多关系庞加莱图嵌入，Multi-relational Poincaré Graph Embeddings

专知会员服务

49+阅读 · 2020年6月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

八篇NeurIPS 2019【图神经网络（GNN）】相关论文

八篇NeurIPS 2019【图神经网络（GNN）】相关论文

专知会员服务

44+阅读 · 2020年1月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【论文】Awesome Relation Extraction Paper（关系抽取）（PART IV）

【论文】Awesome Relation Extraction Paper（关系抽取）（PART IV）

AINLP

15+阅读 · 2019年8月26日

Graph Neural Networks 综述

Graph Neural Networks 综述

计算机视觉life

30+阅读 · 2019年8月13日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

Network Embedding 指南

Network Embedding 指南

专知

22+阅读 · 2018年8月13日

Relation Networks for Object Detection 论文笔记

Relation Networks for Object Detection 论文笔记

统计学习与视觉计算组

16+阅读 · 2018年4月18日

论文浅尝 | Question Answering over Freebase

论文浅尝 | Question Answering over Freebase

开放知识图谱

19+阅读 · 2018年1月9日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

人Muse细胞诱导分化为神经前体细胞及功能性神经元并修复脊髓损伤

国家自然科学基金

0+阅读 · 2015年12月31日

力感应细胞源性外泌体及其microRNA货物介导的细胞间通讯在骨重建稳态中调控机制及干预策略的研究

国家自然科学基金

0+阅读 · 2015年12月31日

随机环境下多个体系统集体行为分析、调控与优化

国家自然科学基金

0+阅读 · 2015年12月31日

奇异耦合网络的动力学分析与控制

国家自然科学基金

0+阅读 · 2015年12月31日

部分同源片段间基因置换的研究及其可视化网络服务平台的建立

国家自然科学基金

0+阅读 · 2015年12月31日

双层网络下的振子集体行为研究：以生物钟神经元网络为例

国家自然科学基金

0+阅读 · 2015年12月31日

基于SCs促进OPCs存活、增殖和迁移的机制探讨SCI治疗的新策略

国家自然科学基金

0+阅读 · 2014年12月31日

新型多环Thienoacene衍生物及其共轭聚合物合成、表征与光伏器件阴极界面修饰研究

国家自然科学基金

0+阅读 · 2014年12月31日

环氧树脂基交联网络微观结构调控及其热致形状记忆构效关系

国家自然科学基金

0+阅读 · 2014年12月31日

基于肽类分子的多组分共组装：理性设计、多级调控与生物应用

国家自然科学基金

2+阅读 · 2014年12月31日

Dynamics of learning to integrate in linear recurrent neural networks

Arxiv

0+阅读 · 6月8日

Semantic Forwarding and Codebook-Enhanced Model Division Multiple Access for Satellite-Terrestrial Networks

Arxiv

0+阅读 · 6月5日

Extreme-Scale Interconnection Networks

Arxiv

0+阅读 · 5月26日

The Construction of Near-optimal Universal Coding of Integers

Arxiv

0+阅读 · 5月14日

Efficient Generative Retrieval for E-commerce Search with Semantic Cluster IDs and Expert-Guided RL

Arxiv

0+阅读 · 5月14日

Cellwise and Casewise Robust Multivariate Regression with Inference

Arxiv

0+阅读 · 5月8日

DistributedEstimator: Distributed Training of Quantum Neural Networks via Circuit Cutting

Arxiv

0+阅读 · 5月5日

Adaptive Reorganization of Neural Pathways for Continual Learning with Spiking Neural Networks

Arxiv

0+阅读 · 5月5日

Similarity and Matching of Neural Network Representations

Arxiv

10+阅读 · 2021年10月27日

Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks

Arxiv

36+阅读 · 2020年5月24日

VIP会员

文章信息

相关主题

最新内容

定向能反无人机系统最新发展动态

定向能反无人机系统最新发展动态

专知会员服务

0+阅读 · 31分钟前

从燃煤战舰到算法战争：水面指挥的永恒要求

从燃煤战舰到算法战争：水面指挥的永恒要求

专知会员服务

1+阅读 · 48分钟前

《短程弹道再入飞行器拦截时间中的一项异常现象》

《短程弹道再入飞行器拦截时间中的一项异常现象》

专知会员服务

1+阅读 · 51分钟前

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

专知会员服务

1+阅读 · 53分钟前

美智库《战术级指挥控制的迫切要求：构建弹性机动式指挥控制网络》报告

美智库《战术级指挥控制的迫切要求：构建弹性机动式指挥控制网络》报告

专知会员服务

1+阅读 · 今天13:13

《韩国国防政策与军备出口：韩国安全与国防政策如何塑造其国防工业与军备出口格局》最新100页报告

《韩国国防政策与军备出口：韩国安全与国防政策如何塑造其国防工业与军备出口格局》最新100页报告

专知会员服务

0+阅读 · 今天13:10

ICML 2026 | VOTP：用视频基础模型与最优传输，让离线偏好强化学习只需少量反馈

ICML 2026 | VOTP：用视频基础模型与最优传输，让离线偏好强化学习只需少量反馈

专知会员服务

5+阅读 · 6月16日

多模态代码智能综述：从视觉输入到可执行代码系统

多模态代码智能综述：从视觉输入到可执行代码系统

专知会员服务

7+阅读 · 6月16日

美国马六甲“三重网”概念：安全网、威慑网与杀伤网

美国马六甲“三重网”概念：安全网、威慑网与杀伤网

专知会员服务

5+阅读 · 6月16日

《面向导弹有效发射时机的监督机器学习方法：基于超视距空战仿真》

《面向导弹有效发射时机的监督机器学习方法：基于超视距空战仿真》

专知会员服务

5+阅读 · 6月16日

《通用大语言模型：无人机指挥与控制接口》最新40页

《通用大语言模型：无人机指挥与控制接口》最新40页

专知会员服务

15+阅读 · 6月16日

《通过小型无人机系统将情报能力“作战化”》

《通过小型无人机系统将情报能力“作战化”》

专知会员服务

6+阅读 · 6月16日

《神经安全型有人–无人协同：面向认知自适应作战能力的参考架构》

《神经安全型有人–无人协同：面向认知自适应作战能力的参考架构》

专知会员服务

10+阅读 · 6月16日

《在指挥链中通过多准则决策分析传达指挥官意图：空战实验》

《在指挥链中通过多准则决策分析传达指挥官意图：空战实验》

专知会员服务

21+阅读 · 6月15日

消耗优势：美军的“精确规模化”概念

消耗优势：美军的“精确规模化”概念

专知会员服务

8+阅读 · 6月15日

相关VIP内容

ICLR 2026 | CoT-Evo：面向科学推理的思维链进化蒸馏框架

ICLR 2026 | CoT-Evo：面向科学推理的思维链进化蒸馏框架

专知会员服务

13+阅读 · 2月6日

NeurIPS2023 | AI4Science: Newton–Cotes图神经网络——动态系统上的时间演化

NeurIPS2023 | AI4Science: Newton–Cotes图神经网络——动态系统上的时间演化

专知会员服务

37+阅读 · 2023年10月25日

【NeurIPS2022】序列(推荐)模型分布外泛化：因果视角与求解

【NeurIPS2022】序列(推荐)模型分布外泛化：因果视角与求解

专知会员服务

14+阅读 · 2022年12月11日

【CVPR 2022】跨模态检索的协同双流视觉-语言前训练模型，COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval

【CVPR 2022】跨模态检索的协同双流视觉-语言前训练模型，COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval

专知会员服务

13+阅读 · 2022年3月12日

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

【CVPR 2022】基于元内存传输的跨域少镜头语义分割，Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer

专知会员服务

13+阅读 · 2022年3月12日

【NeurIPS 2019】多关系庞加莱图嵌入，Multi-relational Poincaré Graph Embeddings

【NeurIPS 2019】多关系庞加莱图嵌入，Multi-relational Poincaré Graph Embeddings

专知会员服务

49+阅读 · 2020年6月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

八篇NeurIPS 2019【图神经网络（GNN）】相关论文

八篇NeurIPS 2019【图神经网络（GNN）】相关论文

专知会员服务

44+阅读 · 2020年1月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

从燃煤战舰到算法战争：水面指挥的永恒要求

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

定向能反无人机系统最新发展动态

《短程弹道再入飞行器拦截时间中的一项异常现象》

相关资讯

【论文】Awesome Relation Extraction Paper（关系抽取）（PART IV）

【论文】Awesome Relation Extraction Paper（关系抽取）（PART IV）

AINLP

15+阅读 · 2019年8月26日

Graph Neural Networks 综述

Graph Neural Networks 综述

计算机视觉life

30+阅读 · 2019年8月13日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

《pyramid Attention Network for Semantic Segmentation》

《pyramid Attention Network for Semantic Segmentation》

统计学习与视觉计算组

44+阅读 · 2018年8月30日

Network Embedding 指南

Network Embedding 指南

专知

22+阅读 · 2018年8月13日

Relation Networks for Object Detection 论文笔记

Relation Networks for Object Detection 论文笔记

统计学习与视觉计算组

16+阅读 · 2018年4月18日

论文浅尝 | Question Answering over Freebase

论文浅尝 | Question Answering over Freebase

开放知识图谱

19+阅读 · 2018年1月9日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

Dynamics of learning to integrate in linear recurrent neural networks

Arxiv

0+阅读 · 6月8日

Semantic Forwarding and Codebook-Enhanced Model Division Multiple Access for Satellite-Terrestrial Networks

Arxiv

0+阅读 · 6月5日

Extreme-Scale Interconnection Networks

Arxiv

0+阅读 · 5月26日

The Construction of Near-optimal Universal Coding of Integers

Arxiv

0+阅读 · 5月14日

Efficient Generative Retrieval for E-commerce Search with Semantic Cluster IDs and Expert-Guided RL

Arxiv

0+阅读 · 5月14日

Cellwise and Casewise Robust Multivariate Regression with Inference

Arxiv

0+阅读 · 5月8日

DistributedEstimator: Distributed Training of Quantum Neural Networks via Circuit Cutting

Arxiv

0+阅读 · 5月5日

Adaptive Reorganization of Neural Pathways for Continual Learning with Spiking Neural Networks

Arxiv

0+阅读 · 5月5日

Similarity and Matching of Neural Network Representations

Arxiv

10+阅读 · 2021年10月27日

Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks

Arxiv

36+阅读 · 2020年5月24日

相关基金

人Muse细胞诱导分化为神经前体细胞及功能性神经元并修复脊髓损伤

国家自然科学基金

0+阅读 · 2015年12月31日

力感应细胞源性外泌体及其microRNA货物介导的细胞间通讯在骨重建稳态中调控机制及干预策略的研究

国家自然科学基金

0+阅读 · 2015年12月31日

随机环境下多个体系统集体行为分析、调控与优化

国家自然科学基金

0+阅读 · 2015年12月31日

奇异耦合网络的动力学分析与控制

国家自然科学基金

0+阅读 · 2015年12月31日

部分同源片段间基因置换的研究及其可视化网络服务平台的建立

国家自然科学基金

0+阅读 · 2015年12月31日

双层网络下的振子集体行为研究：以生物钟神经元网络为例

国家自然科学基金

0+阅读 · 2015年12月31日

基于SCs促进OPCs存活、增殖和迁移的机制探讨SCI治疗的新策略

国家自然科学基金

0+阅读 · 2014年12月31日

新型多环Thienoacene衍生物及其共轭聚合物合成、表征与光伏器件阴极界面修饰研究

国家自然科学基金

0+阅读 · 2014年12月31日

环氧树脂基交联网络微观结构调控及其热致形状记忆构效关系

国家自然科学基金

0+阅读 · 2014年12月31日

基于肽类分子的多组分共组装：理性设计、多级调控与生物应用

国家自然科学基金

2+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员