Deep Operator Network Approximation Rates for Lipschitz Operators

We establish universality and expression rate bounds for a class of neural Deep Operator Networks (DON) emulating Lipschitz (or H\"older) continuous maps $\mathcal G:\mathcal X\to\mathcal Y$ between (subsets of) separable Hilbert spaces $\mathcal X$, $\mathcal Y$. The DON architecture considered uses linear encoders $\mathcal E$ and decoders $\mathcal D$ via (biorthogonal) Riesz bases of $\mathcal X$, $\mathcal Y$, and an approximator network of an infinite-dimensional, parametric coordinate map that is Lipschitz continuous on the sequence space $\ell^2(\mathbb N)$. Unlike previous works ([Herrmann, Schwab and Zech: Neural and Spectral operator surrogates: construction and expression rate bounds, SAM Report, 2022], [Marcati and Schwab: Exponential Convergence of Deep Operator Networks for Elliptic Partial Differential Equations, SAM Report, 2022]), which required for example $\mathcal G$ to be holomorphic, the present expression rate results require mere Lipschitz (or H\"older) continuity of $\mathcal G$. Key in the proof of the present expression rate bounds is the use of either super-expressive activations (e.g. [Yarotski: Elementary superexpressive activations, Int. Conf. on ML, 2021], [Shen, Yang and Zhang: Neural network approximation: Three hidden layers are enough, Neural Networks, 2021], and the references there) which are inspired by the Kolmogorov superposition theorem, or of nonstandard NN architectures with standard (ReLU) activations as recently proposed in [Zhang, Shen and Yang: Neural Network Architecture Beyond Width and Depth, Adv. in Neural Inf. Proc. Sys., 2022]. We illustrate the abstract results by approximation rate bounds for emulation of a) solution operators for parametric elliptic variational inequalities, and b) Lipschitz maps of Hilbert-Schmidt operators.

翻译：本文建立了一类神经深度算子网络（DON）对可分离希尔伯特空间$\mathcal X$，$\mathcal Y$（子集）之间的Lipschitz（或Hölder）连续映射$\mathcal G:\mathcal X\to\mathcal Y$进行模拟的普适性和表示速率界。所考虑的DON架构使用线性编码器$\mathcal E$和解码器$\mathcal D$（通过$\mathcal X$、$\mathcal Y$的双正交Riesz基），以及一个无限维参数坐标映射的逼近网络，该映射在序列空间$\ell^2(\mathbb N)$上是Lipschitz连续的。与先前需要$\mathcal G$为全纯映射的工作（[Herrmann, Schwab and Zech: Neural and Spectral operator surrogates: construction and expression rate bounds, SAM Report, 2022]，[Marcati and Schwab: Exponential Convergence of Deep Operator Networks for Elliptic Partial Differential Equations, SAM Report, 2022]）不同，本文的表示速率结果仅要求$\mathcal G$具有Lipschitz（或Hölder）连续性。证明当前表示速率界的关键在于使用超级表达激活函数（例如[Yarotski: Elementary superexpressive activations, Int. Conf. on ML, 2021]，[Shen, Yang and Zhang: Neural network approximation: Three hidden layers are enough, Neural Networks, 2021]及其参考文献），其灵感来源于Kolmogorov叠加定理，或使用近期[Zhang, Shen and Yang: Neural Network Architecture Beyond Width and Depth, Adv. in Neural Inf. Proc. Sys., 2022]提出的具有标准（ReLU）激活函数的非标准神经网络架构。我们通过以下模拟的逼近速率界来阐述抽象结果：a) 参数椭圆变分不等式的解算子，b) Hilbert-Schmidt算子的Lipschitz映射。