The properties of the generalized Waring distribution defined on the non negative integers are reviewed. Formulas for its moments and its mode are given. A construction as a mixture of negative binomial distributions is also presented. Then we turn to the Petersen model for estimating the population size $N$ in a two-way capture recapture experiment. We construct a Bayesian model for $N$ by combining a Waring prior with the hypergeometric distribution for the number of units caught twice in the experiment. Credible intervals for $N$ are obtained using quantiles of the posterior, a generalized Waring distribution. The standard confidence interval for the population size constructed using the asymptotic variance of Petersen estimator and .5 logit transformed interval are shown to be special cases of the generalized Waring credible interval. The true coverage of this interval is shown to be bigger than or equal to its nominal converage in small populations, regardless of the capture probabilities. In addition, its length is substantially smaller than that of the .5 logit transformed interval. Thus the proposed generalized Waring credible interval appears to be the best way to quantify the uncertainty of the Petersen estimator for populations size.
翻译:回顾了定义在非负整数上的广义Waring分布的性质,给出了其矩和众数的公式,并提出了其作为负二项分布混合的构造方法。随后转向利用Petersen模型估计双向捕获-再捕获实验中的总体规模$N$。通过将Waring先验与实验中被捕获两次的单位数量的超几何分布相结合,构建了关于$N$的贝叶斯模型。基于后验(广义Waring分布)的分位数,得到了$N$的可信区间。研究表明,利用Petersen估计量的渐近方差构造的总体规模标准置信区间以及0.5 logit变换区间,均为广义Waring可信区间的特例。在小型总体中,无论捕获概率如何,该区间的真实覆盖概率始终大于或等于其名义覆盖概率。此外,其区间长度远小于0.5 logit变换区间。因此,所提出的广义Waring可信区间似乎是量化Petersen估计量对总体规模不确定性的最佳方法。