Competition-based Adaptive ReLU for Deep Neural Networks

Activation functions introduce nonlinearity into deep neural networks. Most popular activation functions allow positive values to pass through while blocking or suppressing negative values. From the idea that positive values and negative values are equally important, and they must compete for activation, we proposed a new Competition-based Adaptive ReLU (CAReLU). CAReLU scales the input values based on the competition results between positive values and negative values. It defines two parameters to adjust the scaling strategy and can be trained uniformly with other network parameters. We verify the effectiveness of CAReLU on image classification, super-resolution, and natural language processing tasks. In the experiment, our method performs better than other widely used activation functions. In the case of replacing ReLU in ResNet-18 with our proposed activation function, it improves the classification accuracy on the CIFAR-100 dataset. The effectiveness and the new perspective on the utilization of competition results between positive values and negative values make CAReLU a promising activation function.

翻译：激活函数为深度神经网络引入了非线性。最流行的激活函数允许正值通过，同时阻断或抑制负值。基于正负值同等重要且必须竞争激活的理念，我们提出了一种新的基于竞争的自适应ReLU（CAReLU）。CAReLU根据正值与负值之间的竞争结果对输入值进行缩放。它定义了两个参数来调整缩放策略，并能与其他网络参数统一训练。我们在图像分类、超分辨率和自然语言处理任务上验证了CAReLU的有效性。实验中，我们的方法优于其他广泛使用的激活函数。在将ResNet-18中的ReLU替换为我们提出的激活函数的情况下，该方法提高了在CIFAR-100数据集上的分类准确率。CAReLU的有效性以及对正负值竞争结果利用的新视角，使其成为一种具有前景的激活函数。

相关内容

激活函数

关注 44

在人工神经网络中，给定一个输入或一组输入，节点的激活函数定义该节点的输出。一个标准集成电路可以看作是一个由激活函数组成的数字网络，根据输入的不同，激活函数可以是开(1)或关(0)。这类似于神经网络中的线性感知器的行为。然而，只有非线性激活函数允许这样的网络只使用少量的节点来计算重要问题，并且这样的激活函数被称为非线性。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日