Similarity-Distance-Magnitude Activations

from arxiv, Accepted to Findings of the Association for Computational Linguistics: ACL 2026. 21 pages, 8 tables, 1 algorithm. arXiv admin note: substantial text overlap with arXiv:2502.20167

We introduce the Similarity-Distance-Magnitude (SDM) activation function, a more robust and interpretable formulation of the standard softmax activation function, adding Similarity (i.e., correctly predicted depth-matches into training) awareness and Distance-to-training-distribution awareness to the existing output Magnitude (i.e., decision-boundary) awareness, and enabling interpretability-by-exemplar via dense matching. We further introduce the SDM estimator, based on a data-driven partitioning of the class-wise empirical CDFs via the SDM activation, to control the class- and prediction-conditional accuracy among selective classifications. When used as the final-layer activation over pre-trained language models for selective classification, the SDM estimator is more robust to covariate shifts and out-of-distribution inputs than existing calibration methods using softmax activations, while remaining informative over in-distribution data.

翻译：我们提出相似度-距离-幅度（SDM）激活函数，它是标准Softmax激活函数的一种更鲁棒且更具可解释性的变体。该函数在现有输出幅度（即决策边界）感知能力的基础上，新增了相似性（即正确预测与训练数据的深度匹配）感知能力以及与训练分布距离的感知能力，并通过密集匹配实现了基于样本的可解释性。我们进一步引入了基于SDM激活函数的类经验累积分布函数数据驱动划分的SDM估计器，以控制选择性分类中类别和预测条件准确率。当将该估计器用作预训练语言模型在选择性分类任务中的最后一层激活函数时，相比于使用Softmax激活函数的现有校准方法，它对协变量漂移和分布外输入具有更强的鲁棒性，同时能有效保留分布内数据的信息量。

相关内容

激活函数

关注 44

在人工神经网络中，给定一个输入或一组输入，节点的激活函数定义该节点的输出。一个标准集成电路可以看作是一个由激活函数组成的数字网络，根据输入的不同，激活函数可以是开(1)或关(0)。这类似于神经网络中的线性感知器的行为。然而，只有非线性激活函数允许这样的网络只使用少量的节点来计算重要问题，并且这样的激活函数被称为非线性。

【牛津大学博士论文】深度学习算法的渐近分析，186页pdf

专知会员服务

29+阅读 · 2024年6月27日

【CVPR2022】海德堡大学《深度视觉相似性与度量学习》教程，200+页ppt

专知会员服务

44+阅读 · 2022年6月22日

【Nature通讯】结合深度学习和分子动力学模拟探索蛋白质的长程相互作用模式和酶活性

专知会员服务

19+阅读 · 2022年4月7日

【NeurIPS2021】神经网络表示的相似度和匹配

专知会员服务

27+阅读 · 2021年10月29日