The Plackett--Luce model is a popular approach for rank data analysis, where a utility vector is employed to determine the probability of each outcome based on Luce's choice axiom. In this paper, we investigate the asymptotic theory of utility vector estimation by maximizing different types of likelihood, such as the full-, marginal-, and quasi-likelihood. We provide a rank-matching interpretation for the estimating equations of these estimators and analyze their asymptotic behavior as the number of items being compared tends to infinity. In particular, we establish the uniform consistency of these estimators under conditions characterized by the topology of the underlying comparison graph sequence and demonstrate that the proposed conditions are sharp for common sampling scenarios such as the nonuniform random hypergraph model and the hypergraph stochastic block model; we also obtain the asymptotic normality of these estimators and discuss the trade-off between statistical efficiency and computational complexity for practical uncertainty quantification. Both results allow for nonuniform and inhomogeneous comparison graphs with varying edge sizes and different asymptotic orders of edge probabilities. We verify our theoretical findings by conducting detailed numerical experiments.
翻译:Plackett-Luce模型是一种流行的排序数据分析方法,该方法通过效用向量,并基于Luce选择公理确定每个结果的概率。本文研究了通过最大化不同类型似然(如完全似然、边际似然和拟似然)估计效用向量的渐近理论。我们为这些估计量的估计方程提供了排序匹配解释,并分析其在被比较项目数量趋于无穷大时的渐近行为。具体而言,我们建立了这些估计量在底层比较图序列拓扑刻画条件下的相合性,并证明所提条件对于非均匀随机超图模型和超图随机块模型等常见抽样场景是紧的。我们还推导了这些估计量的渐近正态性,并讨论了实际不确定性量化中统计效率与计算复杂度之间的权衡。两个结果均允许比较图具有非均匀、非同质结构,且边大小可变、边概率具有不同渐近阶。通过详细的数值实验验证了理论发现。