A stochastic optimization approach to minimize robust density power-based divergences for general parametric density models

Density power divergence (DPD) [Basu et al. (1998), Biometrika], designed to estimate the underlying distribution of the observations robustly, comprises an integral term of the power of the parametric density models to be estimated. While the explicit form of the integral term can be obtained for some specific densities (such as normal density and exponential density), its computational intractability has prohibited the application of DPD-based estimation to more general parametric densities, over a quarter of a century since the proposal of DPD. This study proposes a stochastic optimization approach to minimize DPD for general parametric density models and explains its adequacy by referring to conventional theories on stochastic optimization. The proposed approach also can be applied to the minimization of another density power-based $\gamma$-divergence with the aid of unnormalized models [Kanamori and Fujisawa (2015), Biometrika].

翻译：密度幂散度（DPD）[Basu等(1998)，Biometrika]旨在稳健地估计观测数据的潜在分布，其包含一个待估计参数密度模型幂次的积分项。尽管对于某些特定密度（如正态密度和指数密度）可以显式得到该积分项，但自DPD提出以来的四分之一世纪多里，其计算上的不可行性阻碍了基于DPD的估计应用于更一般的参数密度。本研究提出一种随机优化方法，用于最小化一般参数密度模型的DPD，并通过参考随机优化的传统理论解释其适用性。借助非归一化模型，所提方法还可用于最小化另一种基于密度幂次的γ-散度[Kanamori and Fujisawa (2015)，Biometrika]。

相关内容

DPD

关注 9

分布式并行数据库（DPD）在所有传统的以及新兴的数据库研究领域中发表论文，包括：数据集成、数据共享、安全和隐私、事务管理、流程和工作流管理、信息提取、查询处理和优化、分析大型数据集的挖掘和可视化、存储、数据碎片，放置和分配复制协议、可靠性、容错、持久性、保留、性能和可伸缩性以及各种通信和传播平台及中间件的使用。官网地址：http://dblp.uni-trier.de/db/journals/dpd/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

分布外泛化(Out-Of-Distribution Generalization) 综述论文，22页pdf240篇文献

专知会员服务

64+阅读 · 2021年9月2日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日