Enhancing the Scalability and Applicability of Kohn-Sham Hamiltonians for Molecular Systems

Density Functional Theory (DFT) is a pivotal method within quantum chemistry and materials science, with its core involving the construction and solution of the Kohn-Sham Hamiltonian. Despite its importance, the application of DFT is frequently limited by the substantial computational resources required to construct the Kohn-Sham Hamiltonian. In response to these limitations, current research has employed deep-learning models to efficiently predict molecular and solid Hamiltonians, with roto-translational symmetries encoded in their neural networks. However, the scalability of prior models may be problematic when applied to large molecules, resulting in non-physical predictions of ground-state properties. In this study, we generate a substantially larger training set (PubChemQH) than used previously and use it to create a scalable model for DFT calculations with physical accuracy. For our model, we introduce a loss function derived from physical principles, which we call Wavefunction Alignment Loss (WALoss). WALoss involves performing a basis change on the predicted Hamiltonian to align it with the observed one; thus, the resulting differences can serve as a surrogate for orbital energy differences, allowing models to make better predictions for molecular orbitals and total energies than previously possible. WALoss also substantially accelerates self-consistent-field (SCF) DFT calculations. Here, we show it achieves a reduction in total energy prediction error by a factor of 1347 and an SCF calculation speed-up by a factor of 18%. These substantial improvements set new benchmarks for achieving accurate and applicable predictions in larger molecular systems.

翻译：密度泛函理论（DFT）是量子化学和材料科学中的关键方法，其核心在于构建并求解Kohn-Sham哈密顿量。尽管DFT至关重要，但其应用常受限于构建Kohn-Sham哈密顿量所需的大量计算资源。针对这些限制，当前研究已采用深度学习模型来高效预测分子与固体哈密顿量，并在神经网络中编码了旋转平移对称性。然而，现有模型的可扩展性在应用于大分子时可能存在问题，导致对基态性质的非物理预测。在本研究中，我们生成了一个比以往所用数据集大得多的训练集（PubChemQH），并利用它创建了一个具有物理精度的、可扩展的DFT计算模型。对于我们的模型，我们引入了一种基于物理原理推导的损失函数，称之为波函数对齐损失（WALoss）。WALoss涉及对预测的哈密顿量进行基变换以使其与观测值对齐；由此产生的差异可作为轨道能量差异的替代度量，使模型能够比以往更准确地预测分子轨道和总能量。WALoss还显著加速了自洽场（SCF）DFT计算。本文表明，该模型将总能量预测误差降低了1347倍，并将SCF计算速度提升了18%。这些显著改进为在更大分子系统中实现精确且适用的预测设立了新的基准。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日