This paper deals with the estimation of population sizes for respondent-driven sampling (RDS), a variant of link-tracing sampling that leverages social networks over a number of waves to recruit individuals from hidden populations. The RDS process is mostly controlled by individual participants who might report on recruitment proposals, or nominations, that they have received or given. By considering all nominations given or received over a time period, one can create a capture-recapture dataset in which units are individuals who have received at least one nomination and capture occasions are either time intervals or recruitment waves, with the goal of estimating the size $N$ of the hidden population. In this paper, we argue that the underlying process that generated the RDS nomination data is that of a capture-recapture experiment. We then proposed a methodology for the estimation of the population size and investigated its performance against departures from classical capture-recapture assumptions.
翻译:本文探讨了受访者驱动抽样(RDS)中人口规模的估计问题。RDS是一种链接追踪抽样的变体,通过多波次利用社交网络从隐藏人群中招募参与者。RDS过程主要由个体参与者控制,他们可能会报告已收到或给出的招募提议(即提名)。通过考虑特定时间段内所有给出或收到的提名,可以构建一个捕获-再捕获数据集,其中每个单元是至少收到一次提名的个体,捕获时机可以是时间间隔或招募波次,目标是估计隐藏人口规模$N$。本文认为,生成RDS提名数据的基础过程本质上是捕获-再捕获实验。我们随后提出了一种人口规模估计方法,并考察了该方法在偏离经典捕获-再捕获假设时的表现性能。