The Entrapment Problem in Random Walk Decentralized Learning

from arxiv, 10 pages, accepted by 2024 IEEE International Symposium on Information Theory. The associated presentation of this paper can be found in https://www.youtube.com/watch?v=et0sR4lJK_s&ab_channel=LiuZonghong

This paper explores decentralized learning in a graph-based setting, where data is distributed across nodes. We investigate a decentralized SGD algorithm that utilizes a random walk to update a global model based on local data. Our focus is on designing the transition probability matrix to speed up convergence. While importance sampling can enhance centralized learning, its decentralized counterpart, using the Metropolis-Hastings (MH) algorithm, can lead to the entrapment problem, where the random walk becomes stuck at certain nodes, slowing convergence. To address this, we propose the Metropolis-Hastings with L\'evy Jumps (MHLJ) algorithm, which incorporates random perturbations (jumps) to overcome entrapment. We theoretically establish the convergence rate and error gap of MHLJ and validate our findings through numerical experiments.

翻译：本文研究基于图结构的去中心化学习，其中数据分布在各个节点上。我们探讨了一种利用随机游走根据本地数据更新全局模型的去中心化随机梯度下降算法。研究重点在于设计转移概率矩阵以加速收敛。虽然重要性采样能够提升中心化学习的性能，但其去中心化版本——使用Metropolis-Hastings算法的方案——可能导致陷阱问题，即随机游走被困在某些节点，从而延缓收敛速度。为解决此问题，我们提出了带莱维跳跃的Metropolis-Hastings算法，该算法通过引入随机扰动来克服陷阱效应。我们从理论上证明了该算法的收敛速度与误差界，并通过数值实验验证了研究结论。

相关内容

随机漫步

关注 1

在数学中，随机漫步是一种数学对象，称为随机过程或随机过程，它描述的路径由在某些数学空间（例如整数）上的一系列随机步骤组成。随机行走等是指基于过去的表现，无法预测将来的发展步骤和方向。核心概念是指任何无规则行走者所带的守恒量都各自对应着一个扩散运输定律，接近于布朗运动，是布朗运动理想的数学状态，现阶段主要应用于互联网链接分析及金融股票市场中。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日