Bayesian optimized deep ensemble for uncertainty quantification of deep neural networks: a system safety case study on sodium fast reactor thermal stratification modeling

Neural Networks · 集成 · 优化器 · CASE · FAST ·

2024 年 12 月 11 日

翻译：贝叶斯优化的深度集成用于深度神经网络不确定性量化：钠冷快堆热分层建模的系统安全案例研究

Zaid Abulawi,Rui Hu,Prasanna Balaprakash,Yang Liu

Accurate predictions and uncertainty quantification (UQ) are essential for decision-making in risk-sensitive fields such as system safety modeling. Deep ensembles (DEs) are efficient and scalable methods for UQ in Deep Neural Networks (DNNs); however, their performance is limited when constructed by simply retraining the same DNN multiple times with randomly sampled initializations. To overcome this limitation, we propose a novel method that combines Bayesian optimization (BO) with DE, referred to as BODE, to enhance both predictive accuracy and UQ. We apply BODE to a case study involving a Densely connected Convolutional Neural Network (DCNN) trained on computational fluid dynamics (CFD) data to predict eddy viscosity in sodium fast reactor thermal stratification modeling. Compared to a manually tuned baseline ensemble, BODE estimates total uncertainty approximately four times lower in a noise-free environment, primarily due to the baseline's overestimation of aleatoric uncertainty. Specifically, BODE estimates aleatoric uncertainty close to zero, while aleatoric uncertainty dominates the total uncertainty in the baseline ensemble. We also observe a reduction of more than 30% in epistemic uncertainty. When Gaussian noise with standard deviations of 5% and 10% is introduced into the data, BODE accurately fits the data and estimates uncertainty that aligns with the data noise. These results demonstrate that BODE effectively reduces uncertainty and enhances predictions in data-driven models, making it a flexible approach for various applications requiring accurate predictions and robust UQ.

翻译：在系统安全建模等风险敏感领域中，准确的预测与不确定性量化对决策至关重要。深度集成是深度神经网络中进行不确定性量化的一种高效且可扩展的方法；然而，当仅通过随机采样初始化多次重训练同一深度神经网络来构建时，其性能会受到限制。为克服此限制，我们提出一种将贝叶斯优化与深度集成相结合的新方法，称为BODE，以同时提升预测准确性与不确定性量化效果。我们将BODE应用于一个案例研究，该研究使用在计算流体动力学数据上训练的密集连接卷积神经网络来预测钠冷快堆热分层建模中的涡粘性。与手动调参的基线集成相比，在无噪声环境中，BODE估计的总不确定性降低了约四倍，这主要归因于基线方法高估了偶然不确定性。具体而言，BODE估计的偶然不确定性接近于零，而基线集成中的总不确定性主要由偶然不确定性主导。我们还观察到认知不确定性降低了30%以上。当向数据中引入标准差为5%和10%的高斯噪声时，BODE能准确拟合数据并估计出与数据噪声一致的不确定性。这些结果表明，BODE能有效降低数据驱动模型中的不确定性并提升预测性能，使其成为需要准确预测和鲁棒不确定性量化的各种应用的一种灵活方法。

相关内容

Neural Networks

关注 1654

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

《用于无线通信和传感的智能反射面 (IRS)》（ICC 2022）新加坡国立大学2022最新53页slides

专知会员服务

26+阅读 · 2022年11月16日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

分布外泛化(Out-Of-Distribution Generalization) 综述论文，22页pdf240篇文献

专知会员服务

64+阅读 · 2021年9月2日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日