This paper investigates the universal approximation capabilities of Hamiltonian Deep Neural Networks (HDNNs) that arise from the discretization of Hamiltonian Neural Ordinary Differential Equations. Recently, it has been shown that HDNNs enjoy, by design, non-vanishing gradients, which provide numerical stability during training. However, although HDNNs have demonstrated state-of-the-art performance in several applications, a comprehensive study to quantify their expressivity is missing. In this regard, we provide a universal approximation theorem for HDNNs and prove that a portion of the flow of HDNNs can approximate arbitrary well any continuous function over a compact domain. This result provides a solid theoretical foundation for the practical use of HDNNs.
翻译:本文研究了由哈密顿神经常微分方程离散化产生的哈密顿深度神经网络(HDNNs)的通用逼近能力。近期研究表明,HDNNs 通过设计具有非消失梯度特性,从而在训练过程中提供数值稳定性。然而,尽管 HDNNs 已在多项应用中展现出最先进的性能,但目前尚缺乏对其表达能力的系统性量化研究。针对这一问题,我们提出了 HDNNs 的通用逼近定理,证明 HDNNs 的流部分可以在紧致域上以任意精度逼近任意连续函数。该结果为 HDNNs 的实际应用提供了坚实的理论基础。