Extrapolation to complete basis-set limit in density-functional theory by quantile random-forest models - 专知论文

会员服务 ·

0

随机森林 · 本征 · 量化模型 · 模型预测 · 代码 ·

2023 年 3 月 31 日

Extrapolation to complete basis-set limit in density-functional theory by quantile random-forest models

翻译：基于分位数随机森林模型的密度泛函理论完备基组极限外推

Daniel T. Speckhard,Christian Carbogno,Luca Ghiringhelli,Sven Lubeck,Matthias Scheffler,Claudia Draxl

The numerical precision of density-functional-theory (DFT) calculations depends on a variety of computational parameters, one of the most critical being the basis-set size. The ultimate precision is reached with an infinitely large basis set, i.e., in the limit of a complete basis set (CBS). Our aim in this work is to find a machine-learning model that extrapolates finite basis-size calculations to the CBS limit. We start with a data set of 63 binary solids investigated with two all-electron DFT codes, exciting and FHI-aims, which employ very different types of basis sets. A quantile-random-forest model is used to estimate the total-energy correction with respect to a fully converged calculation as a function of the basis-set size. The random-forest model achieves a symmetric mean absolute percentage error of lower than 25% for both codes and outperforms previous approaches in the literature. Our approach also provides prediction intervals, which quantify the uncertainty of the models' predictions.

翻译：密度泛函理论（DFT）计算的数值精度受多种计算参数影响，其中最关键参数之一是基组尺寸。当使用无限大基组时，即达到完备基组（CBS）极限，可获得最终精度。本研究旨在寻找一种机器学习模型，实现有限基组尺寸计算向CBS极限的外推。我们从63种二元固体的数据集出发，采用两种全电子DFT代码（exciting和FHI-aims）进行计算，这两种代码使用了不同类型的基组。我们利用分位数随机森林模型，估算相对于完全收敛计算的总能量修正量作为基组尺寸的函数。该随机森林模型对两种代码的对称平均绝对百分比误差均低于25%，优于文献中已有的方法。此外，我们的方法还能提供预测区间，用于量化模型预测的不确定性。

0

相关内容

随机森林

随机森林指的是利用多棵树对样本进行训练并预测的一种分类器。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【AAAI2022】基于图神经网络的统一离群点异常检测方法

【AAAI2022】基于图神经网络的统一离群点异常检测方法

专知会员服务

28+阅读 · 2022年2月12日

【NeurIPS 2021】设置多智能体策略梯度的方差

【NeurIPS 2021】设置多智能体策略梯度的方差

专知会员服务

21+阅读 · 2021年10月24日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

专知会员服务

43+阅读 · 2020年4月22日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

使用神经网络模型创建一个龙与地下城怪物生成器

使用神经网络模型创建一个龙与地下城怪物生成器

THU数据派

0+阅读 · 2022年6月29日

度量学习中的pair-based loss

度量学习中的pair-based loss

极市平台

65+阅读 · 2019年7月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

【论文推荐】最新七篇图像分割相关论文—域适应深度表示学习、循环残差卷积、二值分割、图像合成、无监督跨模态

【论文推荐】最新七篇图像分割相关论文—域适应深度表示学习、循环残差卷积、二值分割、图像合成、无监督跨模态

专知

19+阅读 · 2018年6月1日

RF、GBDT、XGBoost面试级整理

RF、GBDT、XGBoost面试级整理

数据挖掘入门与实战

17+阅读 · 2018年3月21日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

回归预测&时间序列预测

回归预测&时间序列预测

GBASE数据工程部数据团队

44+阅读 · 2017年5月17日

高维数据保真降维方法研究

国家自然科学基金

2+阅读 · 2014年12月31日

复杂数据下含指标项半参数模型结构的统计推断及应用

国家自然科学基金

0+阅读 · 2014年12月31日

基于分位数回归的高维数据降维及变量选择研究

国家自然科学基金

0+阅读 · 2013年12月31日

椭圆方程组中的向量分析

国家自然科学基金

0+阅读 · 2013年12月31日

高维非参数模型(可加模型，多指标可加模型)的直接变量选择和估计

国家自然科学基金

1+阅读 · 2013年12月31日

框架理论及其在采样定理中的应用

国家自然科学基金

2+阅读 · 2012年12月31日

Spiked模型中特征值和特征向量的理论分析与推断

国家自然科学基金

1+阅读 · 2012年12月31日

相依样本下的经验似然推断

国家自然科学基金

0+阅读 · 2012年12月31日

幂零李群上热核估计的几个问题

国家自然科学基金

0+阅读 · 2012年12月31日

相依与不完全数据的统计推断及其应用研究

国家自然科学基金

0+阅读 · 2008年12月31日

Generalized Bayesian Inference for Scientific Simulators via Amortized Cost Estimation

Arxiv

0+阅读 · 2023年5月24日

Utility-Probability Duality of Neural Networks

Arxiv

0+阅读 · 2023年5月24日

The Limits to Learning a Diffusion Model

Arxiv

0+阅读 · 2023年5月23日

Private Statistical Estimation of Many Quantiles

Arxiv

0+阅读 · 2023年5月23日

Nonparametric estimation of the incubation time distribution

Arxiv

0+阅读 · 2023年5月23日

Squared Neural Families: A New Class of Tractable Density Models

Arxiv

0+阅读 · 2023年5月22日

SE(3) diffusion model with application to protein backbone generation

Arxiv

0+阅读 · 2023年5月22日

Synthetic ECG Signal Generation using Probabilistic Diffusion Models

Arxiv

0+阅读 · 2023年5月22日

Statistical Estimation for Covariance Structures with Tail Estimates using Nodewise Quantile Predictive Regression Models

Arxiv

0+阅读 · 2023年5月18日

Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning

Arxiv

14+阅读 · 2022年3月25日

VIP会员

文章信息

相关主题

最新内容

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

专知会员服务

1+阅读 · 今天11:43

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

专知会员服务

1+阅读 · 今天11:41

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

专知会员服务

4+阅读 · 今天6:30

网状网络及其在军事领域的运用

网状网络及其在军事领域的运用

专知会员服务

4+阅读 · 今天6:18

《意识即战场——全球安全体系中认知战的演进：乌克兰构建认知作战体系的展望》

《意识即战场——全球安全体系中认知战的演进：乌克兰构建认知作战体系的展望》

专知会员服务

5+阅读 · 今天6:08

无美国参与的欧洲战争方式（万字长文）

无美国参与的欧洲战争方式（万字长文）

专知会员服务

5+阅读 · 今天5:54

重构“下一场战争”的制胜理论：超越兰彻斯特方程与现代系统

重构“下一场战争”的制胜理论：超越兰彻斯特方程与现代系统

专知会员服务

5+阅读 · 今天5:22

《国防工业中基于模型定义的实施：产品定义数字化转型的战略路径》90页

《国防工业中基于模型定义的实施：产品定义数字化转型的战略路径》90页

专知会员服务

6+阅读 · 今天5:15

《国防领域敏感性分析白皮书》

《国防领域敏感性分析白皮书》

专知会员服务

6+阅读 · 今天3:42

综述 | 从问答到任务完成：Agent系统与Harness设计

综述 | 从问答到任务完成：Agent系统与Harness设计

专知会员服务

5+阅读 · 6月24日

Agentic RL：框架、实践与长程智能体训练

Agentic RL：框架、实践与长程智能体训练

专知会员服务

6+阅读 · 6月24日

反无人机拦截器训练与运用课程：对美国陆军部队发展的启示

反无人机拦截器训练与运用课程：对美国陆军部队发展的启示

专知会员服务

10+阅读 · 6月24日

重新思考无人机时代的生存能力

重新思考无人机时代的生存能力

专知会员服务

9+阅读 · 6月24日

装甲突击旅：现代战争思考、战斗与组织

装甲突击旅：现代战争思考、战斗与组织

专知会员服务

7+阅读 · 6月24日

在人工智能加速决策环境中拓展OODA循环

在人工智能加速决策环境中拓展OODA循环

专知会员服务

9+阅读 · 6月24日

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【AAAI2022】基于图神经网络的统一离群点异常检测方法

【AAAI2022】基于图神经网络的统一离群点异常检测方法

专知会员服务

28+阅读 · 2022年2月12日

【NeurIPS 2021】设置多智能体策略梯度的方差

【NeurIPS 2021】设置多智能体策略梯度的方差

专知会员服务

21+阅读 · 2021年10月24日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

52+阅读 · 2020年12月14日

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

【NLP模型压缩方法综述】《A Survey of Methods for Model Compression in NLP》by Madison May

专知会员服务

43+阅读 · 2020年4月22日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

网状网络及其在军事领域的运用

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

相关资讯

使用神经网络模型创建一个龙与地下城怪物生成器

使用神经网络模型创建一个龙与地下城怪物生成器

THU数据派

0+阅读 · 2022年6月29日

度量学习中的pair-based loss

度量学习中的pair-based loss

极市平台

65+阅读 · 2019年7月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

【论文推荐】最新七篇图像分割相关论文—域适应深度表示学习、循环残差卷积、二值分割、图像合成、无监督跨模态

【论文推荐】最新七篇图像分割相关论文—域适应深度表示学习、循环残差卷积、二值分割、图像合成、无监督跨模态

专知

19+阅读 · 2018年6月1日

RF、GBDT、XGBoost面试级整理

RF、GBDT、XGBoost面试级整理

数据挖掘入门与实战

17+阅读 · 2018年3月21日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

回归预测&时间序列预测

回归预测&时间序列预测

GBASE数据工程部数据团队

44+阅读 · 2017年5月17日

相关论文

Generalized Bayesian Inference for Scientific Simulators via Amortized Cost Estimation

Arxiv

0+阅读 · 2023年5月24日

Utility-Probability Duality of Neural Networks

Arxiv

0+阅读 · 2023年5月24日

The Limits to Learning a Diffusion Model

Arxiv

0+阅读 · 2023年5月23日

Private Statistical Estimation of Many Quantiles

Arxiv

0+阅读 · 2023年5月23日

Nonparametric estimation of the incubation time distribution

Arxiv

0+阅读 · 2023年5月23日

Squared Neural Families: A New Class of Tractable Density Models

Arxiv

0+阅读 · 2023年5月22日

SE(3) diffusion model with application to protein backbone generation

Arxiv

0+阅读 · 2023年5月22日

Synthetic ECG Signal Generation using Probabilistic Diffusion Models

Arxiv

0+阅读 · 2023年5月22日

Statistical Estimation for Covariance Structures with Tail Estimates using Nodewise Quantile Predictive Regression Models

Arxiv

0+阅读 · 2023年5月18日

Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning

Arxiv

14+阅读 · 2022年3月25日

相关基金

高维数据保真降维方法研究

国家自然科学基金

2+阅读 · 2014年12月31日

复杂数据下含指标项半参数模型结构的统计推断及应用

国家自然科学基金

0+阅读 · 2014年12月31日

基于分位数回归的高维数据降维及变量选择研究

国家自然科学基金

0+阅读 · 2013年12月31日

椭圆方程组中的向量分析

国家自然科学基金

0+阅读 · 2013年12月31日

高维非参数模型(可加模型，多指标可加模型)的直接变量选择和估计

国家自然科学基金

1+阅读 · 2013年12月31日

框架理论及其在采样定理中的应用

国家自然科学基金

2+阅读 · 2012年12月31日

Spiked模型中特征值和特征向量的理论分析与推断

国家自然科学基金

1+阅读 · 2012年12月31日

相依样本下的经验似然推断

国家自然科学基金

0+阅读 · 2012年12月31日

幂零李群上热核估计的几个问题

国家自然科学基金

0+阅读 · 2012年12月31日

相依与不完全数据的统计推断及其应用研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员