A Unified Framework to Enforce, Discover, and Promote Symmetry in Machine Learning

Symmetry is present throughout nature and continues to play an increasingly central role in physics and machine learning. Fundamental symmetries, such as Poincar\'{e} invariance, allow physical laws discovered in laboratories on Earth to be extrapolated to the farthest reaches of the universe. Symmetry is essential to achieving this extrapolatory power in machine learning applications. For example, translation invariance in image classification allows models with fewer parameters, such as convolutional neural networks, to be trained on smaller data sets and achieve state-of-the-art performance. In this paper, we provide a unifying theoretical and methodological framework for incorporating symmetry into machine learning models in three ways: 1. enforcing known symmetry when training a model; 2. discovering unknown symmetries of a given model or data set; and 3. promoting symmetry during training by learning a model that breaks symmetries within a user-specified group of candidates when there is sufficient evidence in the data. We show that these tasks can be cast within a common mathematical framework whose central object is the Lie derivative associated with fiber-linear Lie group actions on vector bundles. We extend and unify several existing results by showing that enforcing and discovering symmetry are linear-algebraic tasks that are dual with respect to the bilinear structure of the Lie derivative. We also propose a novel way to promote symmetry by introducing a class of convex regularization functions based on the Lie derivative and nuclear norm relaxation to penalize symmetry breaking during training of machine learning models. We explain how these ideas can be applied to a wide range of machine learning models including basis function regression, dynamical systems discovery, multilayer perceptrons, and neural networks acting on spatial fields such as images.

翻译：对称性普遍存在于自然界中，并且在物理学和机器学习中持续扮演日益核心的角色。基本对称性，例如庞加莱不变性，使得在地球实验室中发现的物理定律能够被外推到宇宙最遥远的角落。对称性对于在机器学习应用中实现这种外推能力至关重要。例如，图像分类中的平移不变性允许使用参数更少的模型（如卷积神经网络）在更小的数据集上进行训练，同时达到最先进的性能。在本文中，我们提供了一个统一的理论与方法学框架，通过三种方式将对称性融入机器学习模型：1. 在训练模型时强制已知对称性；2. 发现给定模型或数据集的未知对称性；3. 在训练过程中通过学习一个模型来促进对称性，该模型在数据中有足够证据时，会打破用户指定的候选对称性群组内的对称性。我们证明这些任务可以被纳入一个共同的数学框架之中，该框架的核心对象是与向量丛上的纤维线性李群作用相关联的李导数。我们通过展示强制和发现对称性是与李导数的双线性结构对偶的线性代数任务，来扩展并统一了若干现有结果。我们还提出了一种新颖的对称性促进方法，通过引入一类基于李导数和核范数松弛的凸正则化函数，在机器学习模型训练过程中惩罚对称性破缺。我们解释了如何将这些思想应用于广泛的机器学习模型，包括基函数回归、动力系统发现、多层感知器以及作用于图像等空间场上的神经网络。

相关内容

Machine Learning

关注 2251

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Query2box: 使用盒嵌入对向量空间中的知识图谱进行推理，Query2box: Reasoning over Knowledge Graphs in Vector Space Using Box Embeddings

专知会员服务

46+阅读 · 2020年5月11日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日