Who Said Neural Networks Aren't Linear?

Neural networks are famously nonlinear. However, linearity is defined relative to a pair of vector spaces, $f:X \to Y$. Leveraging the algebraic concept of transport of structure, we propose a method to explicitly identify non-standard vector spaces where a neural network acts as a linear operator. When sandwiching a linear operator $A$ between two invertible neural networks, $f(x)=g_y^{-1}(A g_x(x))$, the corresponding vector spaces $X$ and $Y$ are induced by newly defined addition and scaling actions derived from $g_x$ and $g_y$. We term this kind of architecture a Linearizer. This framework makes the entire arsenal of linear algebra, including SVD, pseudo-inverse, orthogonal projection and more, applicable to nonlinear mappings. Furthermore, we show that the composition of two Linearizers that share a neural network is also a Linearizer. We leverage this property and demonstrate that training diffusion models using our architecture makes the hundreds of sampling steps collapse into a single step. We further utilize our framework to enforce idempotency (i.e. $f(f(x))=f(x)$) on networks leading to a globally projective generative model and to demonstrate modular style transfer.

翻译：神经网络以非线性著称。然而，线性性是相对于一对向量空间 $f:X \to Y$ 定义的。利用代数中的结构迁移概念，我们提出了一种方法，可以显式地识别出非标准的向量空间，使得神经网络在其中表现为线性算子。当将一个线性算子 $A$ 夹在两个可逆神经网络之间，即 $f(x)=g_y^{-1}(A g_x(x))$ 时，相应的向量空间 $X$ 和 $Y$ 由从 $g_x$ 和 $g_y$ 导出的新定义的加法与数乘运算所诱导。我们将此类架构称为线性化器。该框架使得包括奇异值分解、伪逆、正交投影等在内的整个线性代数工具库可应用于非线性映射。此外，我们证明了共享一个神经网络的两个线性化器的复合也是一个线性化器。我们利用这一性质，并展示了使用我们的架构训练扩散模型，可以将数百个采样步骤坍缩为单一步骤。我们进一步利用该框架在网络中强制幂等性（即 $f(f(x))=f(x)$），从而得到一个全局投影生成模型，并演示了模块化的风格迁移。

相关内容

神经网络

关注 5917

人工神经网络（Artificial Neural Network，即ANN ），是20世纪80 年代以来人工智能领域兴起的研究热点。它从信息处理角度对人脑神经元网络进行抽象，建立某种简单模型，按不同的连接方式组成不同的网络。在工程与学术界也常直接简称为神经网络或类神经网络。神经网络是一种运算模型，由大量的节点（或称神经元）之间相互联接构成。每个节点代表一种特定的输出函数，称为激励函数（activation function）。每两个节点间的连接都代表一个对于通过该连接信号的加权值，称之为权重，这相当于人工神经网络的记忆。网络的输出则依网络的连接方式，权重值和激励函数的不同而不同。而网络自身通常都是对自然界某种算法或者函数的逼近，也可能是对一种逻辑策略的表达。最近十多年来，人工神经网络的研究工作不断深入，已经取得了很大的进展，其在模式识别、智能机器人、自动控制、预测估计、生物、医学、经济等领域已成功地解决了许多现代计算机难以解决的实际问题，表现出了良好的智能特性。

神经网络如何推理算法？DeepMind Petar等LoG 2022 《神经算法推理》教程，系统性讲解神经网络与经典算法结合

专知会员服务

31+阅读 · 2022年12月22日

【牛津大学博士论文】学习神经网络中的不变表示，130页pdf

专知会员服务

52+阅读 · 2022年10月8日

DeepMind发69页长文掀开AlphaZero的黑盒：神经网络学到的知识和人类基本相似！

专知会员服务

35+阅读 · 2021年12月7日

【伯克利】神经网络中的对称性与同变性，附视频与114页ppt

专知会员服务

25+阅读 · 2020年10月2日