Probing the effects of broken symmetries in machine learning

Symmetry is one of the most central concepts in physics, and it is no surprise that it has also been widely adopted as an inductive bias for machine-learning models applied to the physical sciences. This is especially true for models targeting the properties of matter at the atomic scale. Both established and state-of-the-art approaches, with almost no exceptions, are built to be exactly equivariant to translations, permutations, and rotations of the atoms. Incorporating symmetries -- rotations in particular -- constrains the model design space and implies more complicated architectures that are often also computationally demanding. There are indications that non-symmetric models can easily learn symmetries from data, and that doing so can even be beneficial for the accuracy of the model. We put a model that obeys rotational invariance only approximately to the test, in realistic scenarios involving simulations of gas-phase, liquid, and solid water. We focus specifically on physical observables that are likely to be affected -- directly or indirectly -- by symmetry breaking, finding negligible consequences when the model is used in an interpolative, bulk, regime. Even for extrapolative gas-phase predictions, the model remains very stable, even though symmetry artifacts are noticeable. We also discuss strategies that can be used to systematically reduce the magnitude of symmetry breaking when it occurs, and assess their impact on the convergence of observables.

翻译：对称性是物理学中最核心的概念之一，因此它被广泛采纳为应用于物理科学的机器学习模型的归纳偏置也就不足为奇了。对于针对原子尺度物质性质的模型而言尤其如此。无论是已确立的方法还是最先进的方法，几乎无一例外，都被构建为对原子的平移、置换和旋转具有精确的等变性。融入对称性——尤其是旋转对称性——限制了模型的设计空间，并通常意味着更复杂且计算要求更高的架构。有迹象表明，非对称模型可以轻松地从数据中学习对称性，而且这样做甚至可能有利于模型的准确性。我们在涉及气相、液态和固态水模拟的现实场景中，对一个仅近似服从旋转不变性的模型进行了测试。我们特别关注那些可能直接或间接受对称性破缺影响的物理可观测量，发现当模型用于内插性的体相体系时，其影响微乎其微。即使对于外推性的气相预测，模型也保持非常稳定，尽管对称性伪影是明显的。我们还讨论了可用于系统性地减小对称性破缺发生时其影响幅度的策略，并评估了这些策略对可观测量收敛性的影响。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

【AI应用】Facebook-利用神经网络求解高等数学方程, Using neural networks to solve advanced mathematics equations

专知会员服务

34+阅读 · 2020年1月15日