We consider a Bayesian forecast aggregation model where $n$ experts, after observing private signals about an unknown binary event, report their posterior beliefs about the event to a principal, who then aggregates the reports into a single prediction for the event. The signals of the experts and the outcome of the event follow a joint distribution that is unknown to the principal, but the principal has access to i.i.d. "samples" from the distribution, where each sample is a tuple of the experts' reports (not signals) and the realization of the event. Using these samples, the principal aims to find an $\varepsilon$-approximately optimal aggregator, where optimality is measured in terms of the expected squared distance between the aggregated prediction and the realization of the event. We show that the sample complexity of this problem is at least $\tilde \Omega(m^{n-2} / \varepsilon)$ for arbitrary discrete distributions, where $m$ is the size of each expert's signal space. This sample complexity grows exponentially in the number of experts $n$. But, if the experts' signals are independent conditioned on the realization of the event, then the sample complexity is significantly reduced, to $\tilde O(1 / \varepsilon^2)$, which does not depend on $n$. Our results can be generalized to non-binary events. The proof of our results uses a reduction from the distribution learning problem and reveals the fact that forecast aggregation is almost as difficult as distribution learning.
翻译:我们考虑一个贝叶斯预测聚合模型,其中$n$位专家在观察到关于未知二元事件的私有信号后,向决策者报告其后验信念,决策者随后将这些报告聚合为单一预测。专家的信号与事件结果服从一个对决策者未知的联合分布,但决策者可获取该分布的独立同分布"样本",每个样本包含专家报告(而非信号)与事件实现值的元组。通过利用这些样本,决策者旨在寻找一个$\varepsilon$-近似最优聚合器,其最优性以聚合预测与事件实现值之间的期望平方距离为度量。我们证明:对于任意离散分布,该问题的样本复杂度至少为$\tilde \Omega(m^{n-2} / \varepsilon)$,其中$m$为每位专家的信号空间大小。该样本复杂度随专家数量$n$呈指数增长。然而,若专家信号在事件实现值条件下相互独立,则样本复杂度显著降低至$\tilde O(1 / \varepsilon^2)$,且与$n$无关。我们的结果可推广至非二元事件。证明过程通过归约至分布学习问题,揭示了预测聚合几乎与分布学习具有相同难度的本质。