This paper presents a novel approach for pointwise estimation of multivariate density functions on known domains of arbitrary dimensions using nonparametric local polynomial estimators. Our method is highly flexible, as it applies to both simple domains, such as open connected sets, and more complicated domains that are not star-shaped around the point of estimation. This enables us to handle domains with sharp concavities, holes, and local pinches, such as polynomial sectors. Additionally, we introduce a data-driven selection rule based on the general ideas of Goldenshluger and Lepski. Our results demonstrate that the local polynomial estimators are minimax under a $L^2$ risk across a wide range of H\"older-type functional classes. In the adaptive case, we provide oracle inequalities and explicitly determine the convergence rate of our statistical procedure. Simulations on polynomial sectors show that our oracle estimates outperform those of the most popular alternative method, found in the sparr package for the R software. Our statistical procedure is implemented in an online R package which is readily accessible.
翻译:本文提出了一种新颖的方法,用于在已知任意维数域上对多元密度函数进行逐点估计,采用非参数局部多项式估计器。我们的方法高度灵活,既适用于简单域(如开连通集),也适用于在估计点周围非星形的复杂域。这使得我们能够处理具有尖锐凹性、空洞和局部收缩的域,例如多项式扇区。此外,我们基于Goldenshluger和Lepski的一般思想引入了一种数据驱动的选择规则。结果表明,局部多项式估计器在广泛Hölder型函数类下,在$L^2$风险意义上是极小极大最优的。在自适应情形中,我们提供了神谕不等式,并明确确定了统计过程的收敛速度。在多项式扇区上的模拟显示,我们的神谕估计优于最流行的替代方法(即R软件中sparr包提供的方法)。我们的统计过程已在可公开访问的在线R包中实现。