Weaves, Wires, and Morphisms: Formalizing and Implementing the Algebra of Deep Learning - 专知论文

会员服务 ·

0

形式化 · 数学 · 学习模型 · 深度学习模型 · 深度学习 ·

Weaves, Wires, and Morphisms: Formalizing and Implementing the Algebra of Deep Learning

翻译：《编织、连线与态射：深度学习代数的形式化与实现》

Vincent Abbott,Gioele Zardini

Despite deep learning models running well-defined mathematical functions, we lack a formal mathematical framework for describing model architectures. Ad-hoc notation, diagrams, and pseudocode poorly handle nonlinear broadcasting and the relationship between individual components and composed models. This paper introduces a categorical framework for deep learning models that formalizes broadcasting through the novel axis-stride and array-broadcasted categories. This allows the mathematical function underlying architectures to be precisely expressed and manipulated in a compositional manner. These mathematical definitions are translated into human manageable diagrams and machine manageable data structures. We provide a mirrored implementation in Python (pyncd) and TypeScript (tsncd) to show the universal aspect of our framework, along with features including algebraic construction, graph conversion, PyTorch compilation and diagram rendering. This lays the foundation for a systematic, formal approach to deep learning model design and analysis.

翻译：尽管深度学习模型运行着定义明确的数学函数，但我们仍缺乏描述模型架构的形式化数学框架。临时性的符号、示意图和伪代码难以妥善处理非线性广播以及单个组件与组合模型之间的关系。本文引入了一个面向深度学习模型的范畴论框架，通过新颖的轴步幅范畴和数组广播范畴对广播操作进行形式化。这使得架构背后的数学函数能够以组合方式被精确表达和操控。这些数学定义被转化为人类可理解的示意图和机器可管理的数据结构。我们提供了Python (pyncd) 和TypeScript (tsncd) 中的镜像实现，以展示框架的普适性，附带功能包括代数构建、图转换、PyTorch编译和图表渲染。这为深度学习模型设计与分析的系统化形式化方法奠定了基础。

0

相关内容

形式化

【新书】深度学习的数学和架构，552页pdf

【新书】深度学习的数学和架构，552页pdf

专知会员服务

157+阅读 · 2024年4月25日

【干货书】深度学习的数学导论:方法、实现和理论，601页pdf

【干货书】深度学习的数学导论:方法、实现和理论，601页pdf

专知会员服务

118+阅读 · 2024年1月23日

数学推导详解DL理论！普林斯顿最新127页pdf《深度学习理论》简明书，带你理解深度学习优化、泛化等

数学推导详解DL理论！普林斯顿最新127页pdf《深度学习理论》简明书，带你理解深度学习优化、泛化等

专知会员服务

150+阅读 · 2022年8月29日

【Manning新书】深度学习: 数学与算法模型，Inside Deep Learning，602页pdf

【Manning新书】深度学习: 数学与算法模型，Inside Deep Learning，602页pdf

专知会员服务

197+阅读 · 2022年4月24日

【2024新书】深度学习的数学工程，The Mathematical Engineering of Deep Learning

【2024新书】深度学习的数学工程，The Mathematical Engineering of Deep Learning

专知会员服务

154+阅读 · 2022年4月11日

【Yoshua Bengio经典书】人工智能中的深度结构学习，130页pdf

【Yoshua Bengio经典书】人工智能中的深度结构学习，130页pdf

专知会员服务

45+阅读 · 2021年6月6日

78页最新「深度学习现代数学」大综述论文，数学分析深度学习为何成功的理论

专知会员服务

109+阅读 · 2021年5月15日

【贝叶斯深度学习：一种基于模型的可解释方法】Bayesian deep learning: A model-based interpretable approach

【贝叶斯深度学习：一种基于模型的可解释方法】Bayesian deep learning: A model-based interpretable approach

专知会员服务

49+阅读 · 2020年1月1日

【深度图相似学习综述】Deep Graph Similarity Learning: A Survey，29页pdf，117条参考文献

【深度图相似学习综述】Deep Graph Similarity Learning: A Survey，29页pdf，117条参考文献

专知会员服务

98+阅读 · 2019年12月31日

【干货】深度学习的深度思考，49页pdf，Deep Thoughts on Deep Learning

【干货】深度学习的深度思考，49页pdf，Deep Thoughts on Deep Learning

专知会员服务

30+阅读 · 2019年11月14日

【MIT博士论文】深度学习几何表示，138页pdf

【MIT博士论文】深度学习几何表示，138页pdf

专知

18+阅读 · 2022年9月4日

【干货书】机器学习线性代数与优化，507页pdf

【干货书】机器学习线性代数与优化，507页pdf

专知

23+阅读 · 2022年7月28日

【Manning新书】深度学习: 数学与算法模型，Inside Deep Learning，602页pdf

【Manning新书】深度学习: 数学与算法模型，Inside Deep Learning，602页pdf

专知

36+阅读 · 2022年4月24日

【2022新书】深度学习的数学工程，The Mathematical Engineering of Deep Learning

【2022新书】深度学习的数学工程，The Mathematical Engineering of Deep Learning

专知

29+阅读 · 2022年4月12日

【开放书】深度学习导论，196页pdf，Introduction to Deep Learning

【开放书】深度学习导论，196页pdf，Introduction to Deep Learning

专知

11+阅读 · 2020年7月15日

深度多模态表示学习综述论文，22页pdf

深度多模态表示学习综述论文，22页pdf

专知

33+阅读 · 2020年6月21日

多模态深度学习综述，18页pdf

多模态深度学习综述，18页pdf

专知

51+阅读 · 2020年3月29日

新书《用于计算机视觉、机器人和机器学习的线性代数》，附753页PDF下载

新书《用于计算机视觉、机器人和机器学习的线性代数》，附753页PDF下载

专知

48+阅读 · 2019年11月28日

那些值得推荐和收藏的线性代数学习资源

那些值得推荐和收藏的线性代数学习资源

AINLP

25+阅读 · 2019年3月6日

【干货】深度学习中的线性代数

【干货】深度学习中的线性代数

专知

21+阅读 · 2018年3月30日

复杂环境下机器学习的理论研究

国家自然科学基金

21+阅读 · 2015年12月31日

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

基于深度学习的复杂退化模糊图像恢复

国家自然科学基金

5+阅读 · 2015年12月31日

面向构建过程的范畴学习模型及其适应性机制研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于代数结构及公理语义的泛型约束方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

信息论学习中的正则化及相关高维数据分析方法的数学理论

国家自然科学基金

12+阅读 · 2014年12月31日

基于深度学习的三维模型检索技术

国家自然科学基金

13+阅读 · 2014年12月31日

强非线性偏微分方程基于梯度重构的新型算法

国家自然科学基金

0+阅读 · 2014年12月31日

分数阶偏微分方程与近场动力学等非局部模型的高保真快速算法与数值分析

国家自然科学基金

1+阅读 · 2014年12月31日

奇异线性方程组和具有特定结构的非线性问题的研究与应用

国家自然科学基金

0+阅读 · 2014年12月31日

InfiniteDiffusion: Bridging Learned Fidelity and Procedural Utility for Open-World Terrain Generation

Arxiv

0+阅读 · 5月3日

From Reachability to Learnability: Geometric Design Principles for Quantum Neural Networks

Arxiv

0+阅读 · 3月25日

Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding

Arxiv

0+阅读 · 3月19日

Structured Kolmogorov-Arnold Neural ODEs for Interpretable Learning and Symbolic Discovery of Nonlinear Dynamics

Arxiv

0+阅读 · 3月5日

A Survey on Statistical Theory of Deep Learning: Approximation, Training Dynamics, and Generative Models

Arxiv

14+阅读 · 2024年1月14日

A Review and Roadmap of Deep Causal Model from Different Causal Structures and Representations

Arxiv

13+阅读 · 2023年11月2日

Deep Model Fusion: A Survey

Arxiv

14+阅读 · 2023年9月27日

On Efficient Training of Large-Scale Deep Learning Models: A Literature Review

Arxiv

232+阅读 · 2023年4月7日

Model Complexity of Deep Learning: A Survey

Arxiv

33+阅读 · 2021年3月8日

A Wholistic View of Continual Learning with Deep Neural Networks: Forgotten Lessons and the Bridge to Active and Open World Learning

Arxiv

36+阅读 · 2020年9月3日

VIP会员

文章信息

相关主题

深度学习模型

最新内容

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

专知会员服务

3+阅读 · 6月22日

综述 | 3D场景图：开放挑战与未来方向

综述 | 3D场景图：开放挑战与未来方向

专知会员服务

3+阅读 · 6月22日

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

专知会员服务

3+阅读 · 6月22日

21世纪的无人机战争

21世纪的无人机战争

专知会员服务

3+阅读 · 6月22日

《伊朗与以色列-美国热战及其对数字技术的影响》

《伊朗与以色列-美国热战及其对数字技术的影响》

专知会员服务

3+阅读 · 6月22日

《量子技术的军事任务技术适配与利用》

《量子技术的军事任务技术适配与利用》

专知会员服务

3+阅读 · 6月22日

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

《美国陆军军官学校（西点军校）本科生科研中生成式人工智能的使用》

专知会员服务

4+阅读 · 6月22日

美国从乌克兰无人机战争中学习经验

美国从乌克兰无人机战争中学习经验

专知会员服务

7+阅读 · 6月21日

ICML 2026 | 面向视觉语言模型的语义鲁棒性认证

ICML 2026 | 面向视觉语言模型的语义鲁棒性认证

专知会员服务

5+阅读 · 6月21日

综述 | 智能体电子设计自动化：从“交接有效性”重新理解Agentic EDA

综述 | 智能体电子设计自动化：从“交接有效性”重新理解Agentic EDA

专知会员服务

8+阅读 · 6月21日

深入解读 Palantir AIP：全球最具争议的人工智能平台究竟如何运作

深入解读 Palantir AIP：全球最具争议的人工智能平台究竟如何运作

专知会员服务

21+阅读 · 6月20日

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

专知会员服务

5+阅读 · 6月19日

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

专知会员服务

8+阅读 · 6月19日

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

专知会员服务

7+阅读 · 6月18日

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

专知会员服务

9+阅读 · 6月18日

相关VIP内容

【新书】深度学习的数学和架构，552页pdf

【新书】深度学习的数学和架构，552页pdf

专知会员服务

157+阅读 · 2024年4月25日

【干货书】深度学习的数学导论:方法、实现和理论，601页pdf

【干货书】深度学习的数学导论:方法、实现和理论，601页pdf

专知会员服务

118+阅读 · 2024年1月23日

数学推导详解DL理论！普林斯顿最新127页pdf《深度学习理论》简明书，带你理解深度学习优化、泛化等

数学推导详解DL理论！普林斯顿最新127页pdf《深度学习理论》简明书，带你理解深度学习优化、泛化等

专知会员服务

150+阅读 · 2022年8月29日

【Manning新书】深度学习: 数学与算法模型，Inside Deep Learning，602页pdf

【Manning新书】深度学习: 数学与算法模型，Inside Deep Learning，602页pdf

专知会员服务

197+阅读 · 2022年4月24日

【2024新书】深度学习的数学工程，The Mathematical Engineering of Deep Learning

【2024新书】深度学习的数学工程，The Mathematical Engineering of Deep Learning

专知会员服务

154+阅读 · 2022年4月11日

【Yoshua Bengio经典书】人工智能中的深度结构学习，130页pdf

【Yoshua Bengio经典书】人工智能中的深度结构学习，130页pdf

专知会员服务

45+阅读 · 2021年6月6日

78页最新「深度学习现代数学」大综述论文，数学分析深度学习为何成功的理论

专知会员服务

109+阅读 · 2021年5月15日

【贝叶斯深度学习：一种基于模型的可解释方法】Bayesian deep learning: A model-based interpretable approach

【贝叶斯深度学习：一种基于模型的可解释方法】Bayesian deep learning: A model-based interpretable approach

专知会员服务

49+阅读 · 2020年1月1日

【深度图相似学习综述】Deep Graph Similarity Learning: A Survey，29页pdf，117条参考文献

【深度图相似学习综述】Deep Graph Similarity Learning: A Survey，29页pdf，117条参考文献

专知会员服务

98+阅读 · 2019年12月31日

【干货】深度学习的深度思考，49页pdf，Deep Thoughts on Deep Learning

【干货】深度学习的深度思考，49页pdf，Deep Thoughts on Deep Learning

专知会员服务

30+阅读 · 2019年11月14日

热门VIP内容

开通专知VIP会员享更多权益服务

综述 | 3D场景图：开放挑战与未来方向

21世纪的无人机战争

ICML 2026 | 边界嵌入塑形：用自适应对比学习破解图结构纠缠

《国防工业6.0：全自主作战系统、量子-人工智能融合与新一代战略威慑》

相关资讯

【MIT博士论文】深度学习几何表示，138页pdf

【MIT博士论文】深度学习几何表示，138页pdf

专知

18+阅读 · 2022年9月4日

【干货书】机器学习线性代数与优化，507页pdf

【干货书】机器学习线性代数与优化，507页pdf

专知

23+阅读 · 2022年7月28日

【Manning新书】深度学习: 数学与算法模型，Inside Deep Learning，602页pdf

【Manning新书】深度学习: 数学与算法模型，Inside Deep Learning，602页pdf

专知

36+阅读 · 2022年4月24日

【2022新书】深度学习的数学工程，The Mathematical Engineering of Deep Learning

【2022新书】深度学习的数学工程，The Mathematical Engineering of Deep Learning

专知

29+阅读 · 2022年4月12日

【开放书】深度学习导论，196页pdf，Introduction to Deep Learning

【开放书】深度学习导论，196页pdf，Introduction to Deep Learning

专知

11+阅读 · 2020年7月15日

深度多模态表示学习综述论文，22页pdf

深度多模态表示学习综述论文，22页pdf

专知

33+阅读 · 2020年6月21日

多模态深度学习综述，18页pdf

多模态深度学习综述，18页pdf

专知

51+阅读 · 2020年3月29日

新书《用于计算机视觉、机器人和机器学习的线性代数》，附753页PDF下载

新书《用于计算机视觉、机器人和机器学习的线性代数》，附753页PDF下载

专知

48+阅读 · 2019年11月28日

那些值得推荐和收藏的线性代数学习资源

那些值得推荐和收藏的线性代数学习资源

AINLP

25+阅读 · 2019年3月6日

【干货】深度学习中的线性代数

【干货】深度学习中的线性代数

专知

21+阅读 · 2018年3月30日

相关论文

InfiniteDiffusion: Bridging Learned Fidelity and Procedural Utility for Open-World Terrain Generation

Arxiv

0+阅读 · 5月3日

From Reachability to Learnability: Geometric Design Principles for Quantum Neural Networks

Arxiv

0+阅读 · 3月25日

Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding

Arxiv

0+阅读 · 3月19日

Structured Kolmogorov-Arnold Neural ODEs for Interpretable Learning and Symbolic Discovery of Nonlinear Dynamics

Arxiv

0+阅读 · 3月5日

A Survey on Statistical Theory of Deep Learning: Approximation, Training Dynamics, and Generative Models

Arxiv

14+阅读 · 2024年1月14日

A Review and Roadmap of Deep Causal Model from Different Causal Structures and Representations

Arxiv

13+阅读 · 2023年11月2日

Deep Model Fusion: A Survey

Arxiv

14+阅读 · 2023年9月27日

On Efficient Training of Large-Scale Deep Learning Models: A Literature Review

Arxiv

232+阅读 · 2023年4月7日

Model Complexity of Deep Learning: A Survey

Arxiv

33+阅读 · 2021年3月8日

A Wholistic View of Continual Learning with Deep Neural Networks: Forgotten Lessons and the Bridge to Active and Open World Learning

Arxiv

36+阅读 · 2020年9月3日

相关基金

复杂环境下机器学习的理论研究

国家自然科学基金

21+阅读 · 2015年12月31日

分布式有监督学习的学习理论

国家自然科学基金

17+阅读 · 2015年12月31日

基于深度学习的复杂退化模糊图像恢复

国家自然科学基金

5+阅读 · 2015年12月31日

面向构建过程的范畴学习模型及其适应性机制研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于代数结构及公理语义的泛型约束方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

信息论学习中的正则化及相关高维数据分析方法的数学理论

国家自然科学基金

12+阅读 · 2014年12月31日

基于深度学习的三维模型检索技术

国家自然科学基金

13+阅读 · 2014年12月31日

强非线性偏微分方程基于梯度重构的新型算法

国家自然科学基金

0+阅读 · 2014年12月31日

分数阶偏微分方程与近场动力学等非局部模型的高保真快速算法与数值分析

国家自然科学基金

1+阅读 · 2014年12月31日

奇异线性方程组和具有特定结构的非线性问题的研究与应用

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员