Machine-learned interatomic potentials (MLIPs) are increasingly used to replace computationally demanding electronic-structure calculations to model matter at the atomic scale. The most commonly used model architectures are constrained to fulfill a number of physical laws exactly, from geometric symmetries to energy conservation. Evidence is mounting that relaxing some of these constraints can be beneficial to the efficiency and (somewhat surprisingly) accuracy of MLIPs, even though care should be taken to avoid qualitative failures associated with the breaking of physical symmetries. Given the recent trend of scaling up models to larger numbers of parameters and training samples, a very important question is how unconstrained MLIPs behave in this limit. Here we investigate this issue, showing that -- when trained on large datasets -- unconstrained models can be superior in accuracy and speed when compared to physically constrained models. We assess these models both in terms of benchmark accuracy and in terms of usability in practical scenarios, focusing on static simulation workflows such as geometry optimization and lattice dynamics. We conclude that accurate unconstrained models can be applied with confidence, especially since simple inference-time modifications can be used to recover observables that are consistent with the relevant physical symmetries.
翻译:机器学习原子间势(MLIPs)正越来越多地被用于替代计算密集型的电子结构计算,以在原子尺度上模拟物质。最常用的模型架构被约束为严格满足一系列物理定律,从几何对称性到能量守恒。越来越多的证据表明,放松其中一些约束有助于提升MLIPs的效率,并且(某种程度上令人惊讶地)精度,尽管需要注意避免因破坏物理对称性而导致的定性失败。鉴于近期模型向更大参数数量及更多训练样本扩展的趋势,一个至关重要的问题是:无约束MLIPs在这种极限条件下的表现如何。本文针对此问题展开研究,表明——当在大规模数据集上训练时——与物理约束模型相比,无约束模型在精度和速度上均可能更优。我们从基准精度以及实际场景中的可用性两个方面评估这些模型,重点关注静态模拟工作流,例如几何优化和晶格动力学。我们得出结论:准确的无约束模型可以放心应用,尤其是因为可以通过简单的推理时修正来获得与相关物理对称性一致的可观测量。