Finite sample expansions and risk bounds in high-dimensional SLS models

This note extends the results of classical parametric statistics like Fisher and Wilks theorem to modern setups with a high or infinite parameter dimension, limited sample size, and possible model misspecification. We consider a special class of stochastically linear smooth (SLS) models satisfying three major conditions: the stochastic component of the log-likelihood is linear in the model parameter and the expected log-likelihood is a smooth and concave function. For the penalized maximum likelihood estimators (pMLE), we establish three types of results: (1) concentration in a small vicinity of the ``truth''; (2) Fisher and Wilks expansions; (3) risk bounds. In all results, the remainder is given explicitly and can be evaluated in terms of the effective sample size and effective parameter dimension which allows us to identify the so-called \emph{critical parameter dimension}. The results are also dimension and coordinate-free. The obtained finite sample expansions are of special interest because they can be used not only for obtaining the risk bounds but also for inference, studying the asymptotic distribution, analysis of resampling procedures, etc. The main tool for all these expansions is the so-called ``basic lemma'' about linearly perturbed optimization. Despite their generality, all the presented bounds are nearly sharp and the classical asymptotic results can be obtained as simple corollaries. Our results indicate that the use of advanced fourth-order expansions allows to relax the critical dimension condition $ \mathbb{p}^{3} \ll n $ from Spokoiny (2023a) to $ \mathbb{p}^{3/2} \ll n $. Examples for classical models like logistic regression, log-density and precision matrix estimation illustrate the applicability of general results.

翻译：本笔记将经典参数统计中的Fisher和Wilks定理等结果推广到具有高维或无限维参数、有限样本量以及可能存在模型误设的现代设定。我们考虑一类满足三个主要条件的随机线性光滑（SLS）特殊模型：对数似然的随机分量在模型参数中是线性的，且期望对数似然是光滑凹函数。针对惩罚极大似然估计量（pMLE），我们建立了三类结果：（1）在“真实值”小邻域内的集中性；（2）Fisher与Wilks展开；（3）风险界。所有结果中的余项均被显式给出，并可通过有效样本量和有效参数维度进行评估，这使得我们能够识别所谓的**临界参数维度**。这些结果同时具有维度无关性和坐标无关性。所获得的有限样本展开具有特殊意义，因为它们不仅可用于推导风险界，还可用于统计推断、渐近分布研究、重抽样过程分析等。所有这些展开的主要工具是所谓的关于线性扰动优化的“基本引理”。尽管具有一般性，本文给出的所有界近乎尖锐，且经典渐近结果可作为简单推论获得。我们的结果表明，采用高阶四阶展开可将Spokoiny (2023a) 中的临界维度条件 $ \mathbb{p}^{3} \ll n $ 放宽至 $ \mathbb{p}^{3/2} \ll n $。逻辑回归、对数密度估计和精度矩阵估计等经典模型的示例说明了通用结果的适用性。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【WSDM2020】超越统计关系：将知识关系整合到多标签音乐风格分类的风格关联中（附pdf）

专知会员服务

18+阅读 · 2019年11月23日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日