A Multi-In and Multi-Out Dendritic Neuron Model and its Optimization

Artificial neural networks (ANNs), inspired by the interconnection of real neurons, have achieved unprecedented success in various fields such as computer vision and natural language processing. Recently, a novel mathematical ANN model, known as the dendritic neuron model (DNM), has been proposed to address nonlinear problems by more accurately reflecting the structure of real neurons. However, the single-output design limits its capability to handle multi-output tasks, significantly lowering its applications. In this paper, we propose a novel multi-in and multi-out dendritic neuron model (MODN) to tackle multi-output tasks. Our core idea is to introduce a filtering matrix to the soma layer to adaptively select the desired dendrites to regress each output. Because such a matrix is designed to be learnable, MODN can explore the relationship between each dendrite and output to provide a better solution to downstream tasks. We also model a telodendron layer into MODN to simulate better the real neuron behavior. Importantly, MODN is a more general and unified framework that can be naturally specialized as the DNM by customizing the filtering matrix. To explore the optimization of MODN, we investigate both heuristic and gradient-based optimizers and introduce a 2-step training method for MODN. Extensive experimental results performed on 11 datasets on both binary and multi-class classification tasks demonstrate the effectiveness of MODN, with respect to accuracy, convergence, and generality.

翻译：人工神经网络（ANNs）受真实神经元互联结构的启发，在计算机视觉和自然语言处理等多个领域取得了前所未有的成功。近年来，一种称为树突神经元模型（DNM）的新型数学ANN模型被提出，其通过更精确地反映真实神经元结构来解决非线性问题。然而，单输出设计限制了其处理多输出任务的能力，显著降低了该模型的应用价值。本文提出一种新型多输入多输出树突神经元模型（MODN）以应对多输出任务。核心思想是在胞体层引入过滤矩阵，通过自适应方式选择所需树突来回归每个输出。由于该矩阵被设计为可学习的，MODN能够探索每个树突与输出之间的关系，从而为下游任务提供更优解决方案。我们还在MODN中构建了终树突层以更真实地模拟神经元行为。值得注意的是，MODN是一个更通用的统一框架，通过定制过滤矩阵可自然地特化为DNM。为探究MODN的优化方法，我们研究了启发式优化器和基于梯度的优化器，并提出了适用于MODN的两步训练法。在11个数据集上针对二分类和多分类任务开展的大量实验结果表明，MODN在精度、收敛性和泛化能力方面均展现出显著有效性。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日