An Informational Parsimony Perspective on Symmetry-Based Structure Extraction

Extraction of structure, in particular of group symmetries, is increasingly crucial to understanding and building intelligent models. In particular, some information-theoretic models of parsimonious learning have been argued to induce invariance extraction. Here, we formalise these arguments from a group-theoretic perspective. We then extend them to the study of more general probabilistic symmetries, through compressions preserving well-studied geometric measures of complexity. More precisely, we formalise a trade-off between compression and preservation of the divergence from a given hierarchical model, yielding a novel generalisation of the Information Bottleneck framework. Through appropriate choices of hierarchical models, we fully characterise (in the discrete and full support case) channel invariance, channel equivariance and distribution invariance under permutation. Allowing imperfect divergence preservation then leads to principled definitions of "soft symmetries", where the "coarseness" corresponds to the degree of compression of the system. In simple synthetic experiments, we demonstrate that our method successively recovers, at increasingly compressed "resolutions", nested but increasingly perturbed equivariances, where new equivariances emerge at bifurcation points of the trade-off parameter. Our framework suggests a new path for the extraction of generalised probabilistic symmetries.

翻译：结构提取，特别是群对称性的提取，对于理解和构建智能模型日益关键。具体而言，一些基于信息论的简约学习模型被认为能够诱导不变性提取。本文从群论视角对这些论点进行了形式化。随后，我们通过保留经过深入研究的几何复杂度度量，将这些论点推广到更一般的概率对称性研究中。更精确地说，我们形式化了压缩与保持给定层次模型散度之间的权衡，从而提出了一种信息瓶颈框架的新颖推广。通过适当选择层次模型，我们完整刻画了（在离散且全支撑情况下）置换下的通道不变性、通道等变性与分布不变性。允许不完美的散度保持则引出了"软对称性"的原则性定义，其中"粗糙度"对应于系统压缩的程度。在简单的合成实验中，我们证明了我们的方法能够在逐渐压缩的"分辨率"下，依次恢复嵌套但扰动逐渐增大的等变性，新的等变性在权衡参数的分岔点处涌现。我们的框架为提取广义概率对称性提出了一条新路径。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日