Exponential families from a single KL identity - 专知论文

会员服务 ·

0

正则化 · 配分函数 · 变分 · KL散度 · 散度 ·

Exponential families from a single KL identity

翻译：从单个KL恒等式看指数族

Exponential families encompass the distributions central to modern machine learning -- softmax, Gaussians, and Boltzmann distributions -- and underlie the theory of variational inference, entropy-regularized reinforcement learning, and RLHF. We isolate a simple identity for exponential families that expresses the KL difference $\mathrm{KL}(q \| p_{λ_2}) - \mathrm{KL}(q \| p_{λ_1})$ in terms of the log-partition function $A(λ)$ and the moment $μ_q$. Remarkably, this identity together with the single fact that $\mathrm{KL} \geq 0$ (with equality iff $p = q$) suffices, by direct substitution and rearrangement, to derive a cluster of results that are classically obtained by separate, heavier arguments: a generalized three-point identity for arbitrary reference distributions, Pythagorean theorems for I-projections and reverse I-projections, convexity of the log-partition function, identification of its Legendre dual in KL terms, the Gibbs variational principle, and the explicit optimizer in KL-regularized reward maximization, including the exponential tilting formula underlying entropy-regularized control and RLHF. Beyond these purely algebraic consequences, standard analytic arguments recover the gradient formula for the log-partition function, the Bregman representation of within-family KL divergence, and the surjectivity of the moment map. The note is self-contained.

翻译：指数族涵盖了现代机器学习中核心的分布——softmax、高斯分布和玻尔兹曼分布——并构成了变分推断、熵正则化强化学习以及RLHF的理论基础。我们提炼出指数族的一个简单恒等式，该恒等式将KL散度差$\mathrm{KL}(q \| p_{λ_2}) - \mathrm{KL}(q \| p_{λ_1})$用对数配分函数$A(λ)$和矩$μ_q$表示。值得注意的是，该恒等式结合$\mathrm{KL} \geq 0$（当且仅当$p = q$时取等号）这一单一事实，通过直接代入和重排，即可推导出一系列通常需要采用各自繁复论证方法才能得到的结果：针对任意参考分布的广义三点恒等式、I投影和逆I投影的勾股定理、对数配分函数的凸性、其勒让德对偶在KL框架下的辨识、吉布斯变分原理，以及KL正则化奖励最大化问题（包括熵正则化控制和RLHF背后的指数倾斜公式）的显式优化解。除这些纯代数推论外，标准分析论证方法还可恢复对数配分函数的梯度公式、族内KL散度的Bregman表示以及矩映射的满射性。本文自成体系。

0

相关内容

正则化

在数学，统计学和计算机科学中，尤其是在机器学习和逆问题中，正则化是添加信息以解决不适定问题或防止过度拟合的过程。正则化适用于不适定的优化问题中的目标函数。

复旦大学邱锡鹏等《自然语言处理范式迁移综述》论文，详述7大NLP范式：分类、匹配、SeqLab, MRC, Seq2Seq等

专知会员服务

54+阅读 · 2021年9月29日

分布外泛化(Out-Of-Distribution Generalization) 综述论文，22页pdf240篇文献

专知会员服务

64+阅读 · 2021年9月2日

【伯克利经典书】图模型,指数族与变分推断，305页pdf

专知会员服务

49+阅读 · 2021年8月1日

【AAAI2021】对比聚类，Contrastive Clustering

【AAAI2021】对比聚类，Contrastive Clustering

专知会员服务

78+阅读 · 2021年1月30日

【经典书】图模型: 指数族和变分推断，305页pdf

专知会员服务

52+阅读 · 2020年12月10日

【NeurIPS2020】通过最大编码率降低原理学习多样和有判别性的表示

【NeurIPS2020】通过最大编码率降低原理学习多样和有判别性的表示

专知会员服务

15+阅读 · 2020年9月30日

清华大学张长水等最新《少样本学习FSL》2020综述论文，30页pdf414篇参考文献

清华大学张长水等最新《少样本学习FSL》2020综述论文，30页pdf414篇参考文献

专知会员服务

174+阅读 · 2020年9月13日

随机特征核近似综述: 算法与理论，Random Features for Kernel Approximation: A Survey in Algorithms, Theory, and Beyond

随机特征核近似综述: 算法与理论，Random Features for Kernel Approximation: A Survey in Algorithms, Theory, and Beyond

专知会员服务

33+阅读 · 2020年4月26日

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

专知会员服务

35+阅读 · 2019年11月30日

992页《初等微积分：无穷小方法》(Elementary Calculus. An Infinitesimal Approach)书籍【附下载】

992页《初等微积分：无穷小方法》(Elementary Calculus. An Infinitesimal Approach)书籍【附下载】

专知会员服务

26+阅读 · 2019年10月28日

【AAAI2021】对比聚类，Contrastive Clustering

【AAAI2021】对比聚类，Contrastive Clustering

专知

26+阅读 · 2021年1月30日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知

21+阅读 · 2020年5月30日

面向深度学习研究者的*概率分布*基础教程（附代码）

面向深度学习研究者的*概率分布*基础教程（附代码）

专知

10+阅读 · 2019年9月9日

992页《初等微积分：无穷小方法》书籍【附下载】

992页《初等微积分：无穷小方法》书籍【附下载】

专知

29+阅读 · 2019年4月27日

《小样本学习(Few-shot learning)》最新41页综述论文，来自港科大和第四范式

《小样本学习(Few-shot learning)》最新41页综述论文，来自港科大和第四范式

专知

363+阅读 · 2019年4月12日

详解GAN的谱归一化（Spectral Normalization）

详解GAN的谱归一化（Spectral Normalization）

PaperWeekly

11+阅读 · 2019年2月13日

一文梳理数据科学家必备核心算法与常用模型

一文梳理数据科学家必备核心算法与常用模型

THU数据派

16+阅读 · 2018年5月10日

变分自编码器（Variational Autoencoder, VAE）通俗教程，细节、基础、符号解释很齐全

变分自编码器（Variational Autoencoder, VAE）通俗教程，细节、基础、符号解释很齐全

CreateAMind

12+阅读 · 2018年4月7日

从香农熵到手推KL散度：一文带你纵览机器学习中的信息论

从香农熵到手推KL散度：一文带你纵览机器学习中的信息论

算法与数学之美

10+阅读 · 2018年1月14日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

有限域上指数和的计算及其在序列设计中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

伽罗华环上指数和及其在编码理论中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

几类随机指数函数空间的应用

国家自然科学基金

0+阅读 · 2015年12月31日

若干类广义正则半群代数结构的研究

国家自然科学基金

0+阅读 · 2015年12月31日

关于某些代数曲线K2群的研究

国家自然科学基金

1+阅读 · 2015年12月31日

Heisenberg 群上的 k-平面变换

国家自然科学基金

0+阅读 · 2015年12月31日

有限域上指数和与量子码的研究

国家自然科学基金

0+阅读 · 2014年12月31日

一般半群和广义正则半群的代数理论

国家自然科学基金

0+阅读 · 2014年12月31日

信息论学习中的正则化及相关高维数据分析方法的数学理论

国家自然科学基金

12+阅读 · 2014年12月31日

随机Helmholtz型问题的数值方法

国家自然科学基金

0+阅读 · 2014年12月31日

Exponentially weighted estimands and the exponential family: Filtering, prediction and smoothing

Arxiv

0+阅读 · 4月28日

From Physics to Statistics: A Simple Route to Exponential Families via Maximum Entropy

Arxiv

0+阅读 · 4月24日

Geometric Monomial (GEM): a family of rational 2N-differentiable activation functions

Arxiv

0+阅读 · 4月23日

A class of locally differentially $4$-uniform power functions with Niho exponents

Arxiv

0+阅读 · 4月15日

A family of divergence-based correlation measures for contingency tables under bivariate normality

Arxiv

0+阅读 · 4月15日

KL Divergence Between Gaussians: A Step-by-Step Derivation for the Variational Autoencoder Objective

Arxiv

0+阅读 · 4月13日

Unifying the Hoover and Gini indices: Analytical, bias, and computational aspects

Arxiv

0+阅读 · 3月27日

Exponential Family Discriminant Analysis: Generalizing LDA-Style Generative Classification to Non-Gaussian Models

Arxiv

0+阅读 · 3月24日

Estimation Method under Three-Parameter Generalized Exponential Model: Consistency, Uniqueness and its Applications

Arxiv

0+阅读 · 2月28日

Clust-PSI-PFL: A Population Stability Index Approach for Clustered Non-IID Personalized Federated Learning

Arxiv

0+阅读 · 2月20日

VIP会员

文章信息

相关主题

最新内容

美国边境监控技术演变：无人机与人工智能系统（2001-2025年）（中文版下载，1.3万字）

美国边境监控技术演变：无人机与人工智能系统（2001-2025年）（中文版下载，1.3万字）

专知会员服务

5+阅读 · 今天6:24

技术突袭：俄乌战争中新型精确打击武器的战术与效应（中文版下载，2万字）

技术突袭：俄乌战争中新型精确打击武器的战术与效应（中文版下载，2万字）

专知会员服务

5+阅读 · 今天4:57

《基于生成式通信模型的分布式智能体学习》127页

《基于生成式通信模型的分布式智能体学习》127页

专知会员服务

6+阅读 · 今天3:38

《应对无人机威胁：欧洲反无人机系统》最新报告

《应对无人机威胁：欧洲反无人机系统》最新报告

专知会员服务

5+阅读 · 今天3:35

俄罗斯无人机战线实验

俄罗斯无人机战线实验

专知会员服务

5+阅读 · 今天3:29

高阶网络的表示：基于图的框架综述

高阶网络的表示：基于图的框架综述

专知会员服务

8+阅读 · 5月14日

【ICML2026】面向长上下文大语言模型的训练-推理一致性分段执行

【ICML2026】面向长上下文大语言模型的训练-推理一致性分段执行

专知会员服务

4+阅读 · 5月14日

俄乌冲突中的高超音速武器系统及效能评估（中文版PDF下载）

俄乌冲突中的高超音速武器系统及效能评估（中文版PDF下载）

专知会员服务

20+阅读 · 5月14日

《战略冲突的数学建模：基于变分不等式、不动点理论、间隙函数与微分博弈的美以伊冲突分析》

《战略冲突的数学建模：基于变分不等式、不动点理论、间隙函数与微分博弈的美以伊冲突分析》

专知会员服务

12+阅读 · 5月14日

《人工智能中的多智能体自主决策》380页博士论文

《人工智能中的多智能体自主决策》380页博士论文

专知会员服务

16+阅读 · 5月14日

《作战资源再分配的作战行动数学模型构建》

《作战资源再分配的作战行动数学模型构建》

专知会员服务

14+阅读 · 5月14日

乌克兰作为杀伤网实验室：情报监视侦察（ISR）网络赋能自适应无人机战争——经验分析

乌克兰作为杀伤网实验室：情报监视侦察（ISR）网络赋能自适应无人机战争——经验分析

专知会员服务

10+阅读 · 5月14日

【博士论文】面向可扩展且可信智能系统的强化学习

【博士论文】面向可扩展且可信智能系统的强化学习

专知会员服务

8+阅读 · 5月13日

世界动作模型: 具身AI的下一个前沿

世界动作模型: 具身AI的下一个前沿

专知会员服务

15+阅读 · 5月13日

全球十大防空反导系统：列表、射程与用途

全球十大防空反导系统：列表、射程与用途

专知会员服务

16+阅读 · 5月13日

相关VIP内容

复旦大学邱锡鹏等《自然语言处理范式迁移综述》论文，详述7大NLP范式：分类、匹配、SeqLab, MRC, Seq2Seq等

专知会员服务

54+阅读 · 2021年9月29日

分布外泛化(Out-Of-Distribution Generalization) 综述论文，22页pdf240篇文献

专知会员服务

64+阅读 · 2021年9月2日

【伯克利经典书】图模型,指数族与变分推断，305页pdf

专知会员服务

49+阅读 · 2021年8月1日

【AAAI2021】对比聚类，Contrastive Clustering

【AAAI2021】对比聚类，Contrastive Clustering

专知会员服务

78+阅读 · 2021年1月30日

【经典书】图模型: 指数族和变分推断，305页pdf

专知会员服务

52+阅读 · 2020年12月10日

【NeurIPS2020】通过最大编码率降低原理学习多样和有判别性的表示

【NeurIPS2020】通过最大编码率降低原理学习多样和有判别性的表示

专知会员服务

15+阅读 · 2020年9月30日

清华大学张长水等最新《少样本学习FSL》2020综述论文，30页pdf414篇参考文献

清华大学张长水等最新《少样本学习FSL》2020综述论文，30页pdf414篇参考文献

专知会员服务

174+阅读 · 2020年9月13日

随机特征核近似综述: 算法与理论，Random Features for Kernel Approximation: A Survey in Algorithms, Theory, and Beyond

随机特征核近似综述: 算法与理论，Random Features for Kernel Approximation: A Survey in Algorithms, Theory, and Beyond

专知会员服务

33+阅读 · 2020年4月26日

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

【变分推断课件】Lectures on Variational Inference： Approximate Bayesian Inference in Machine Learning（附带pdf）

专知会员服务

35+阅读 · 2019年11月30日

992页《初等微积分：无穷小方法》(Elementary Calculus. An Infinitesimal Approach)书籍【附下载】

992页《初等微积分：无穷小方法》(Elementary Calculus. An Infinitesimal Approach)书籍【附下载】

专知会员服务

26+阅读 · 2019年10月28日

热门VIP内容

开通专知VIP会员享更多权益服务

技术突袭：俄乌战争中新型精确打击武器的战术与效应（中文版下载，2万字）

《应对无人机威胁：欧洲反无人机系统》最新报告

美国边境监控技术演变：无人机与人工智能系统（2001-2025年）（中文版下载，1.3万字）

《基于生成式通信模型的分布式智能体学习》127页

相关资讯

【AAAI2021】对比聚类，Contrastive Clustering

【AAAI2021】对比聚类，Contrastive Clustering

专知

26+阅读 · 2021年1月30日

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

(普林斯顿讲义)：高维概率论，326页pdf《Probability in High Dimension》

专知

21+阅读 · 2020年5月30日

面向深度学习研究者的*概率分布*基础教程（附代码）

面向深度学习研究者的*概率分布*基础教程（附代码）

专知

10+阅读 · 2019年9月9日

992页《初等微积分：无穷小方法》书籍【附下载】

992页《初等微积分：无穷小方法》书籍【附下载】

专知

29+阅读 · 2019年4月27日

《小样本学习(Few-shot learning)》最新41页综述论文，来自港科大和第四范式

《小样本学习(Few-shot learning)》最新41页综述论文，来自港科大和第四范式

专知

363+阅读 · 2019年4月12日

详解GAN的谱归一化（Spectral Normalization）

详解GAN的谱归一化（Spectral Normalization）

PaperWeekly

11+阅读 · 2019年2月13日

一文梳理数据科学家必备核心算法与常用模型

一文梳理数据科学家必备核心算法与常用模型

THU数据派

16+阅读 · 2018年5月10日

变分自编码器（Variational Autoencoder, VAE）通俗教程，细节、基础、符号解释很齐全

变分自编码器（Variational Autoencoder, VAE）通俗教程，细节、基础、符号解释很齐全

CreateAMind

12+阅读 · 2018年4月7日

从香农熵到手推KL散度：一文带你纵览机器学习中的信息论

从香农熵到手推KL散度：一文带你纵览机器学习中的信息论

算法与数学之美

10+阅读 · 2018年1月14日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Exponentially weighted estimands and the exponential family: Filtering, prediction and smoothing

Arxiv

0+阅读 · 4月28日

From Physics to Statistics: A Simple Route to Exponential Families via Maximum Entropy

Arxiv

0+阅读 · 4月24日

Geometric Monomial (GEM): a family of rational 2N-differentiable activation functions

Arxiv

0+阅读 · 4月23日

A class of locally differentially $4$-uniform power functions with Niho exponents

Arxiv

0+阅读 · 4月15日

A family of divergence-based correlation measures for contingency tables under bivariate normality

Arxiv

0+阅读 · 4月15日

KL Divergence Between Gaussians: A Step-by-Step Derivation for the Variational Autoencoder Objective

Arxiv

0+阅读 · 4月13日

Unifying the Hoover and Gini indices: Analytical, bias, and computational aspects

Arxiv

0+阅读 · 3月27日

Exponential Family Discriminant Analysis: Generalizing LDA-Style Generative Classification to Non-Gaussian Models

Arxiv

0+阅读 · 3月24日

Estimation Method under Three-Parameter Generalized Exponential Model: Consistency, Uniqueness and its Applications

Arxiv

0+阅读 · 2月28日

Clust-PSI-PFL: A Population Stability Index Approach for Clustered Non-IID Personalized Federated Learning

Arxiv

0+阅读 · 2月20日

相关基金

有限域上指数和的计算及其在序列设计中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

伽罗华环上指数和及其在编码理论中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

几类随机指数函数空间的应用

国家自然科学基金

0+阅读 · 2015年12月31日

若干类广义正则半群代数结构的研究

国家自然科学基金

0+阅读 · 2015年12月31日

关于某些代数曲线K2群的研究

国家自然科学基金

1+阅读 · 2015年12月31日

Heisenberg 群上的 k-平面变换

国家自然科学基金

0+阅读 · 2015年12月31日

有限域上指数和与量子码的研究

国家自然科学基金

0+阅读 · 2014年12月31日

一般半群和广义正则半群的代数理论

国家自然科学基金

0+阅读 · 2014年12月31日

信息论学习中的正则化及相关高维数据分析方法的数学理论

国家自然科学基金

12+阅读 · 2014年12月31日

随机Helmholtz型问题的数值方法

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员