Nonuniform random feature models using derivative information

We propose nonuniform data-driven parameter distributions for neural network initialization based on derivative data of the function to be approximated. These parameter distributions are developed in the context of non-parametric regression models based on shallow neural networks, and compare favorably to well-established uniform random feature models based on conventional weight initialization. We address the cases of Heaviside and ReLU activation functions, and their smooth approximations (sigmoid and softplus), and use recent results on the harmonic analysis and sparse representation of neural networks resulting from fully trained optimal networks. Extending analytic results that give exact representation, we obtain densities that concentrate in regions of the parameter space corresponding to neurons that are well suited to model the local derivatives of the unknown function. Based on these results, we suggest simplifications of these exact densities based on approximate derivative data in the input points that allow for very efficient sampling and lead to performance of random feature models close to optimal networks in several scenarios.

翻译：我们提出了一种基于待逼近函数导数数据的神经网络初始化非均匀数据驱动参数分布方法。这些参数分布是在基于浅层神经网络的非参数回归模型框架下发展的，相比基于传统权重初始化的成熟均匀随机特征模型具有显著优势。我们分别处理了Heaviside和ReLU激活函数及其光滑近似（sigmoid和softplus）的情况，并运用了关于完全训练最优神经网络所得谐波分析与稀疏表示的最新研究成果。通过扩展精确表示的解析结果，我们获得了在参数空间中集中于特定区域的概率密度分布，这些区域对应的神经元非常适合建模未知函数的局部导数。基于这些结果，我们提出了基于输入点近似导数数据的精确密度简化方案，该方案支持高效采样，并在多种场景下使随机特征模型的性能接近最优网络。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

IJCAI2022《对抗序列决策》教程，164页ppt

专知会员服务

47+阅读 · 2022年7月27日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

（CVPR2021）基于结构保持的弱监督目标定位

专知会员服务

21+阅读 · 2021年5月1日

最新《Transformers模型》教程，64页ppt

专知会员服务

326+阅读 · 2020年11月26日