Collaborative learning techniques have the potential to enable training machine learning models that are superior to models trained on a single entity's data. However, in many cases, potential participants in such collaborative schemes are competitors on a downstream task, such as firms that each aim to attract customers by providing the best recommendations. This can incentivize dishonest updates that damage other participants' models, potentially undermining the benefits of collaboration. In this work, we formulate a game that models such interactions and study two learning tasks within this framework: single-round mean estimation and multi-round SGD on strongly-convex objectives. For a natural class of player actions, we show that rational clients are incentivized to strongly manipulate their updates, preventing learning. We then propose mechanisms that incentivize honest communication and ensure learning quality comparable to full cooperation. Lastly, we empirically demonstrate the effectiveness of our incentive scheme on a standard non-convex federated learning benchmark. Our work shows that explicitly modeling the incentives and actions of dishonest clients, rather than assuming them malicious, can enable strong robustness guarantees for collaborative learning.
翻译:协作学习技术有潜力训练出优于单一实体数据训练的机器学习模型。然而,在许多情况下,此类协作方案的潜在参与者在下游任务中是竞争者,例如各自通过提供最佳推荐来吸引客户的公司。这可能激励不诚实的更新,损害其他参与者的模型,从而可能削弱协作的优势。在本工作中,我们构建了一个博弈模型来描述此类交互,并在此框架内研究了两种学习任务:单轮均值估计和多轮强凸目标上的随机梯度下降。对于一类自然的参与者行为,我们展示了理性客户有强烈动机操纵其更新,从而阻碍学习。随后,我们提出了激励诚实沟通并确保学习质量与完全协作相当的机制。最后,我们在一个标准的非凸联邦学习基准上实验证明了我们激励方案的有效性。我们的工作表明,明确建模不诚实客户的激励和行为(而非假设其为恶意)可以为协作学习提供强大的鲁棒性保证。