Modern deep learning tools are remarkably effective in addressing intricate problems. However, their operation as black-box models introduces increased uncertainty in predictions. Additionally, they contend with various challenges, including the need for substantial storage space in large networks, issues of overfitting, underfitting, vanishing gradients, and more. This study explores the concept of Bayesian Neural Networks, presenting a novel architecture designed to significantly alleviate the storage space complexity of a network. Furthermore, we introduce an algorithm adept at efficiently handling uncertainties, ensuring robust convergence values without becoming trapped in local optima, particularly when the objective function lacks perfect convexity.
翻译:现代深度学习工具在处理复杂问题方面表现出色。然而,其作为黑箱模型的运作方式增加了预测中的不确定性。此外,它们还面临诸多挑战,包括大型网络需要大量存储空间、过拟合、欠拟合、梯度消失等问题。本研究探讨了贝叶斯神经网络的概念,提出了一种旨在显著降低网络存储空间复杂度的新型架构。同时,我们引入了一种能够高效处理不确定性的算法,确保在目标函数不完全凸时,既能获得稳健的收敛值,又不会陷入局部最优。