ENN: A Neural Network with DCT Adaptive Activation Functions

from arxiv, Paper accepted in IEEE Journal of Selected Topics in Signal Processing (JSTSP) Special Series on AI in Signal & Data Science - Toward Explainable, Reliable, and Sustainable Machine Learning

The expressiveness of neural networks highly depends on the nature of the activation function, although these are usually assumed predefined and fixed during the training stage. Under a signal processing perspective, in this paper we present Expressive Neural Network (ENN), a novel model in which the non-linear activation functions are modeled using the Discrete Cosine Transform (DCT) and adapted using backpropagation during training. This parametrization keeps the number of trainable parameters low, is appropriate for gradient-based schemes, and adapts to different learning tasks. This is the first non-linear model for activation functions that relies on a signal processing perspective, providing high flexibility and expressiveness to the network. We contribute with insights in the explainability of the network at convergence by recovering the concept of bump, this is, the response of each activation function in the output space. Finally, through exhaustive experiments we show that the model can adapt to classification and regression tasks. The performance of ENN outperforms state of the art benchmarks, providing above a 40% gap in accuracy in some scenarios.

翻译：摘要：神经网络的表达能力高度依赖于激活函数的性质，尽管这些函数在训练阶段通常被预设为固定形式。本文从信号处理视角出发，提出了一种新颖的模型——表达性神经网络（ENN）。在该模型中，非线性激活函数通过离散余弦变换（DCT）进行建模，并在训练过程中利用反向传播进行自适应调整。这种参数化方法不仅保持了较低的可训练参数量，适用于基于梯度的优化方案，还能适应不同的学习任务。这是首个基于信号处理视角的激活函数非线性模型，为网络提供了高度的灵活性和表达能力。通过恢复"凸包"概念（即每个激活函数在输出空间中的响应），我们为网络收敛后的可解释性提供了深刻见解。最后，通过大量实验证明，该模型能够适应分类与回归任务。ENN的性能超越了当前最先进的基准模型，在某些场景下准确率提升超过40%。

相关内容

激活函数

关注 44

在人工神经网络中，给定一个输入或一组输入，节点的激活函数定义该节点的输出。一个标准集成电路可以看作是一个由激活函数组成的数字网络，根据输入的不同，激活函数可以是开(1)或关(0)。这类似于神经网络中的线性感知器的行为。然而，只有非线性激活函数允许这样的网络只使用少量的节点来计算重要问题，并且这样的激活函数被称为非线性。

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日