We study the problem of vertex-weighted online bipartite matching with stochastic rewards where matches may fail with some known probability and the decision maker has to adapt to the sequential realization of these outcomes. Recent works have studied several special cases of this problem and it was known that the (randomized) Perturbed Greedy algorithm due to Aggarwal et al. (SODA, 2011) achieves the best possible competitive ratio guarantee of $(1-1/e)$ in some cases. We give a simple proof of these results by reducing (special cases of) the stochastic rewards problem to the deterministic setting of online bipartite matching (Karp, Vazirani, Vazirani (STOC, 1990)). More broadly, our approach gives conditions under which it suffices to analyze the competitive ratio of an algorithm for the simpler setting of deterministic rewards in order to obtain a competitive ratio guarantee for stochastic rewards. The simplicity of our approach reveals that the Perturbed Greedy algorithm has a competitive ratio of $(1-1/e)$ even in certain settings with correlated rewards, where no results were previously known. Finally, we show that without any special assumptions, the Perturbed Greedy algorithm has a competitive ratio strictly less than $(1-1/e)$ for vertex-weighted online matching with stochastic rewards.
翻译:我们研究了顶点加权在线二分匹配问题,其中匹配可能以已知概率失败,决策者需适应这些结果的顺序实现。Recent work已研究该问题的若干特例,且已知由Aggarwal等人(SODA, 2011)提出的(随机化)扰动贪心算法在某些情况下可实现最优竞争比保证(1-1/e)。我们通过将随机奖励问题的(特例)归约为在线二分匹配的确定性场景(Karp, Vazirani, Vazirani(STOC, 1990)),给出这些结果的一个简洁证明。更广泛地,我们的方法给出了在何种条件下,仅需分析确定性奖励简化场景中算法的竞争比即可获得随机奖励的竞争比保证。我们方法的简洁性表明,即使在先前无已知结果的关联奖励场景中,扰动贪心算法仍具有竞争比(1-1/e)。最后,我们证明在无任何特殊假设下,针对带有随机奖励的顶点加权在线匹配问题,扰动贪心算法的竞争比严格小于(1-1/e)。