We propose a neural network model for contextual regression in which the regression model depends on contextual features that determine the active submodel and an algorithm to fit the model. The proposed simple contextual neural network (SCtxtNN) separates context identification from context-specific regression, resulting in a structured and interpretable architecture with fewer parameters than a fully connected feed-forward network. We show mathematically that the proposed architecture is sufficient to represent contextual linear regression models using only standard neural network components. Numerical experiments are provided to support the theoretical result, showing that the proposed model achieves lower excess mean squared error and more stable performance than feed-forward neural networks with comparable numbers of parameters, while larger networks improve accuracy only at the cost of increased complexity. The results suggest that incorporating contextual structure can improve model efficiency while preserving interpretability.
翻译:我们提出了一种用于上下文回归的神经网络模型,其中回归模型依赖于决定活跃子模型的上下文特征,并配套开发了相应的模型拟合算法。所提出的简单上下文神经网络(SCtxtNN)将上下文识别与特定上下文回归相分离,从而形成了一种结构清晰、可解释性强的架构,其参数数量少于全连接前馈网络。我们从数学上证明,该架构仅使用标准神经网络组件就足以表示上下文线性回归模型。数值实验验证了理论结果,表明与参数数量相当的前馈神经网络相比,所提出的模型实现了更低的超额均方误差和更稳定的性能,而更大的网络仅能以增加复杂度为代价来提高精度。研究结果表明,融入上下文结构可以在保持可解释性的同时提升模型效率。