Function-space regularized Rényi divergences - 专知论文

会员服务 ·

0

散度 · 正则化项 · 泛函 · 方差 · 易处理的 ·

2023 年 2 月 14 日

Function-space regularized Rényi divergences

翻译：函数空间正则化的Rényi散度

Jeremiah Birrell,Yannis Pantazis,Paul Dupuis,Markos A. Katsoulakis,Luc Rey-Bellet

from arxiv, 24 pages, 4 figures

We propose a new family of regularized R\'enyi divergences parametrized not only by the order $\alpha$ but also by a variational function space. These new objects are defined by taking the infimal convolution of the standard R\'enyi divergence with the integral probability metric (IPM) associated with the chosen function space. We derive a novel dual variational representation that can be used to construct numerically tractable divergence estimators. This representation avoids risk-sensitive terms and therefore exhibits lower variance, making it well-behaved when $\alpha>1$; this addresses a notable weakness of prior approaches. We prove several properties of these new divergences, showing that they interpolate between the classical R\'enyi divergences and IPMs. We also study the $\alpha\to\infty$ limit, which leads to a regularized worst-case-regret and a new variational representation in the classical case. Moreover, we show that the proposed regularized R\'enyi divergences inherit features from IPMs such as the ability to compare distributions that are not absolutely continuous, e.g., empirical measures and distributions with low-dimensional support. We present numerical results on both synthetic and real datasets, showing the utility of these new divergences in both estimation and GAN training applications; in particular, we demonstrate significantly reduced variance and improved training performance.

翻译：我们提出了一类新的正则化Rényi散度族，该类散度不仅由阶数$\alpha$参数化，还由变分函数空间参数化。这些新对象通过将标准Rényi散度与所选函数空间相关联的积分概率度量（IPM）进行下确界卷积而定义。我们推导了一种新颖的对偶变分表示，可用于构造数值上易于处理的散度估计器。该表示避免了风险敏感项，因此表现出更低的方差，并在$\alpha>1$时具有良好的性质；这解决了先前方法的一个显著缺陷。我们证明了这些新散度的若干性质，表明它们在经典Rényi散度和IPM之间进行插值。我们还研究了$\alpha\to\infty$的极限情况，该极限导致正则化的最坏情况遗憾，并在经典情形下提供了新的变分表示。此外，我们表明所提出的正则化Rényi散度继承了IPM的特征，例如能够比较并非绝对连续的分布，例如经验测度和具有低维支撑的分布。我们在合成数据集和真实数据集上呈现了数值结果，展示了这些新散度在估计和生成对抗网络（GAN）训练应用中的实用性；特别地，我们证明了显著降低的方差和提升的训练性能。

0

相关内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

37+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

AAAI2020 图相关论文集

AAAI2020 图相关论文集

图与推荐

11+阅读 · 2020年7月15日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

靶向血小板膜糖蛋白GPIbα抑制肿瘤转移的作用与分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

脊髓GRK2/Epac1调控小胶质细胞表型转化在电针缓解慢性痛中的作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

a-突触核蛋白磷酸化相关激酶polo-like kinases在帕金森病发病机制中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

胶体溶液中介质分子诱导亚稳态单质纳米晶的生长与相变研究

国家自然科学基金

0+阅读 · 2013年12月31日

金属/高介电（HfO2）栅功函数的界面和磁性调制

国家自然科学基金

0+阅读 · 2013年12月31日

隧道结中单分子电致发光的角向分布研究

国家自然科学基金

0+阅读 · 2013年12月31日

高功率单频电泵浦垂直外腔面发射半导体激光器研究

国家自然科学基金

0+阅读 · 2012年12月31日

有机分子半导体的非局域电声子耦合：声子色散与二阶电声子相互作用的影响

国家自然科学基金

0+阅读 · 2012年12月31日

广藿香主要活性成份形成的遗传机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

钙敏感受体在缺氧诱导Aβ36807;量生成中的作用及其分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

Towards Flexibility and Interpretability of Gaussian Process State-Space Model

Arxiv

0+阅读 · 2023年4月6日

Synthetic Sample Selection for Generalized Zero-Shot Learning

Arxiv

0+阅读 · 2023年4月6日

Function Approximation with Randomly Initialized Neural Networks for Approximate Model Reference Adaptive Control

Arxiv

0+阅读 · 2023年4月5日

Robust Forecasting for Robotic Control: A Game-Theoretic Approach

Arxiv

0+阅读 · 2023年4月5日

A Bayesian Collocation Integral Method for Parameter Estimation in Ordinary Differential Equations

Arxiv

0+阅读 · 2023年4月4日

Semiparametric efficient estimation of genetic relatedness with double machine learning

Arxiv

0+阅读 · 2023年4月4日

Distributionally robust mixed-integer programming with Wasserstein metric: on the value of uncertain data

Arxiv

0+阅读 · 2023年4月3日

Action Pick-up in Dynamic Action Space Reinforcement Learning

Arxiv

0+阅读 · 2023年4月3日

Rényi Divergence Deep Mutual Learning

Arxiv

0+阅读 · 2023年4月3日

Universal Private Estimators

Arxiv

0+阅读 · 2023年4月1日

VIP会员

文章信息

相关主题

最新内容

综述 | Weights or Skills?：机器人学习从动作预测权重到自编写技能

综述 | Weights or Skills?：机器人学习从动作预测权重到自编写技能

专知会员服务

0+阅读 · 23分钟前

论文 | Causal Inference with Unstructured Outcomes：面向文本与图像结果的因果推断

论文 | Causal Inference with Unstructured Outcomes：面向文本与图像结果的因果推断

专知会员服务

0+阅读 · 31分钟前

面向2027年及未来的海军情报改革

面向2027年及未来的海军情报改革

专知会员服务

3+阅读 · 8月5日

透视一体化防空：人工智能如何重构从探测到杀伤的靶向全流程

透视一体化防空：人工智能如何重构从探测到杀伤的靶向全流程

专知会员服务

6+阅读 · 8月5日

《多武器毁伤效能评估：解析解与优化瞄准点研究》

《多武器毁伤效能评估：解析解与优化瞄准点研究》

专知会员服务

6+阅读 · 8月5日

《一种面向不确定作战环境的异构无人机协同任务与航路规划随机多目标优化方法》

《一种面向不确定作战环境的异构无人机协同任务与航路规划随机多目标优化方法》

专知会员服务

7+阅读 · 8月5日

《一种基于博弈论的海军平台动态武器分配问题求解方法》

《一种基于博弈论的海军平台动态武器分配问题求解方法》

专知会员服务

5+阅读 · 8月5日

《一种面向武器目标分配的快速可扩展Transformer-指针强化学习框架》

《一种面向武器目标分配的快速可扩展Transformer-指针强化学习框架》

专知会员服务

7+阅读 · 8月5日

ACM MM 2026 | DualG-MRAG：解耦宏观推理与微观匹配的多模态检索增强生成

ACM MM 2026 | DualG-MRAG：解耦宏观推理与微观匹配的多模态检索增强生成

专知会员服务

5+阅读 · 8月5日

综述 | Self-Evolving Coding Agents：自进化编程智能体

综述 | Self-Evolving Coding Agents：自进化编程智能体

专知会员服务

6+阅读 · 8月5日

战火淬炼创新：美军联合战备训练中心探讨现代战场挑战

战火淬炼创新：美军联合战备训练中心探讨现代战场挑战

专知会员服务

5+阅读 · 8月5日

美海军陆战队将三型无人机整合入统一战场网络

美海军陆战队将三型无人机整合入统一战场网络

专知会员服务

3+阅读 · 8月5日

《战术指挥控制要务：构建韧性机动指挥控制网格》美智库报告

《战术指挥控制要务：构建韧性机动指挥控制网格》美智库报告

专知会员服务

5+阅读 · 8月5日

《无人机蜂群：释放人类-蜂群编队的潜能》

《无人机蜂群：释放人类-蜂群编队的潜能》

专知会员服务

6+阅读 · 8月5日

《战略战术化：一项综合性述评》

《战略战术化：一项综合性述评》

专知会员服务

4+阅读 · 8月5日

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

37+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

论文 | Causal Inference with Unstructured Outcomes：面向文本与图像结果的因果推断

透视一体化防空：人工智能如何重构从探测到杀伤的靶向全流程

综述 | Weights or Skills?：机器人学习从动作预测权重到自编写技能

面向2027年及未来的海军情报改革

相关资讯

AAAI2020 图相关论文集

AAAI2020 图相关论文集

图与推荐

11+阅读 · 2020年7月15日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Towards Flexibility and Interpretability of Gaussian Process State-Space Model

Arxiv

0+阅读 · 2023年4月6日

Synthetic Sample Selection for Generalized Zero-Shot Learning

Arxiv

0+阅读 · 2023年4月6日

Function Approximation with Randomly Initialized Neural Networks for Approximate Model Reference Adaptive Control

Arxiv

0+阅读 · 2023年4月5日

Robust Forecasting for Robotic Control: A Game-Theoretic Approach

Arxiv

0+阅读 · 2023年4月5日

A Bayesian Collocation Integral Method for Parameter Estimation in Ordinary Differential Equations

Arxiv

0+阅读 · 2023年4月4日

Semiparametric efficient estimation of genetic relatedness with double machine learning

Arxiv

0+阅读 · 2023年4月4日

Distributionally robust mixed-integer programming with Wasserstein metric: on the value of uncertain data

Arxiv

0+阅读 · 2023年4月3日

Action Pick-up in Dynamic Action Space Reinforcement Learning

Arxiv

0+阅读 · 2023年4月3日

Rényi Divergence Deep Mutual Learning

Arxiv

0+阅读 · 2023年4月3日

Universal Private Estimators

Arxiv

0+阅读 · 2023年4月1日

相关基金

靶向血小板膜糖蛋白GPIbα抑制肿瘤转移的作用与分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

脊髓GRK2/Epac1调控小胶质细胞表型转化在电针缓解慢性痛中的作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

a-突触核蛋白磷酸化相关激酶polo-like kinases在帕金森病发病机制中的作用

国家自然科学基金

0+阅读 · 2013年12月31日

胶体溶液中介质分子诱导亚稳态单质纳米晶的生长与相变研究

国家自然科学基金

0+阅读 · 2013年12月31日

金属/高介电（HfO2）栅功函数的界面和磁性调制

国家自然科学基金

0+阅读 · 2013年12月31日

隧道结中单分子电致发光的角向分布研究

国家自然科学基金

0+阅读 · 2013年12月31日

高功率单频电泵浦垂直外腔面发射半导体激光器研究

国家自然科学基金

0+阅读 · 2012年12月31日

有机分子半导体的非局域电声子耦合：声子色散与二阶电声子相互作用的影响

国家自然科学基金

0+阅读 · 2012年12月31日

广藿香主要活性成份形成的遗传机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

钙敏感受体在缺氧诱导Aβ36807;量生成中的作用及其分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员