From robust tests to Bayes-like posterior distributions

In the Bayes paradigm and for a given loss function, we propose the construction of a new type of posterior distributions, that extends the classical Bayes one, for estimating the law of an $n$-sample. The loss functions we have in mind are based on the total variation and Hellinger distances as well as some $\mathbb{L}_{j}$-ones. We prove that, with a probability close to one, this new posterior distribution concentrates its mass in a neighbourhood of the law of the data, for the chosen loss function, provided that this law belongs to the support of the prior or, at least, lies close enough to it. We therefore establish that the new posterior distribution enjoys some robustness properties with respect to a possible misspecification of the prior, or more precisely, its support. For the total variation and squared Hellinger losses, we also show that the posterior distribution keeps its concentration properties when the data are only independent, hence not necessarily i.i.d., provided that most of their marginals or the average of these are close enough to some probability distribution around which the prior puts enough mass. The posterior distribution is therefore also stable with respect to the equidistribution assumption. We illustrate these results by several applications. We consider the problems of estimating a location parameter or both the location and the scale of a density in a nonparametric framework. Finally, we also tackle the problem of estimating a density, with the squared Hellinger loss, in a high-dimensional parametric model under some sparsity conditions. The results established in this paper are non-asymptotic and provide, as much as possible, explicit constants.

翻译：在贝叶斯范式下，针对给定的损失函数，我们提出了一种新型后验分布的构建方法，该方法扩展了经典贝叶斯后验分布，用于估计$n$个样本的分布律。我们所考虑的损失函数基于全变差距离、Hellinger距离以及某些$\mathbb{L}_{j}$距离。我们证明，在接近1的概率下，当所选损失函数对应的数据分布律属于先验支撑集或其足够接近时，这种新的后验分布会将其质量集中在数据分布律的邻域内。因此，我们确立了该新型后验分布对先验（更确切地说是其支撑集）可能存在的误设具有稳健性。对于全变差损失和平方Hellinger损失，我们还证明，当数据仅独立（不必独立同分布）时，只要其大部分边缘分布或这些边缘分布的平均值足够接近先验赋予足够质量的某个概率分布，后验分布仍能保持其集中性。因此，该后验分布对等分布假设也具有稳定性。我们通过多个应用实例说明了这些结果：研究非参数框架下位置参数或位置与尺度联合估计问题；最后，在稀疏条件下处理高维参数模型中基于平方Hellinger损失的密度估计问题。本文所建立的结论均为非渐近的，并尽可能给出了显式常数。

相关内容

损失函数（机器学习）

关注 10

损失函数，在AI中亦称呼距离函数，度量函数。此处的距离代表的是抽象性的，代表真实数据与预测数据之间的误差。损失函数（loss function）是用来估量你模型的预测值f(x)与真实值Y的不一致程度，它是一个非负实值函数,通常使用L(Y, f(x))来表示，损失函数越小，模型的鲁棒性就越好。损失函数是经验风险函数的核心部分，也是结构风险函数重要组成部分。

DiffRec: 扩散推荐模型（SIGIR'23）

专知会员服务

48+阅读 · 2023年4月16日

分布外泛化(Out-Of-Distribution Generalization) 综述论文，22页pdf240篇文献

专知会员服务

64+阅读 · 2021年9月2日

【ACL2020】多模态信息抽取，365页ppt

专知会员服务

151+阅读 · 2020年7月6日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日