This paper studies decentralized bilevel optimization, in which multiple agents collaborate to solve problems involving nested optimization structures with neighborhood communications. Most existing literature primarily utilizes gradient tracking to mitigate the influence of data heterogeneity, without exploring other well-known heterogeneity-correction techniques such as EXTRA or Exact Diffusion. Additionally, these studies often employ identical decentralized strategies for both upper- and lower-level problems, neglecting to leverage distinct mechanisms across different levels. To address these limitations, this paper proposes SPARKLE, a unified Single-loop Primal-dual AlgoRithm frameworK for decentraLized bilEvel optimization. SPARKLE offers the flexibility to incorporate various heterogeneitycorrection strategies into the algorithm. Moreover, SPARKLE allows for different strategies to solve upper- and lower-level problems. We present a unified convergence analysis for SPARKLE, applicable to all its variants, with state-of-the-art convergence rates compared to existing decentralized bilevel algorithms. Our results further reveal that EXTRA and Exact Diffusion are more suitable for decentralized bilevel optimization, and using mixed strategies in bilevel algorithms brings more benefits than relying solely on gradient tracking.
翻译:本文研究去中心化双层优化问题,其中多个智能体通过邻域通信协作解决涉及嵌套优化结构的问题。现有文献主要利用梯度跟踪来缓解数据异质性的影响,而未探索其他已知的异质性校正技术(如EXTRA或精确扩散)。此外,这些研究通常对上层和下层问题采用相同的去中心化策略,未能利用不同层级间的差异化机制。为克服这些局限性,本文提出SPARKLE——一种用于去中心化双层优化的统一单循环原始对偶算法框架。SPARKLE具备灵活性,可将多种异质性校正策略融入算法。同时,该框架允许采用不同策略分别求解上层和下层问题。我们为SPARKLE提出了统一的收敛性分析,该分析适用于其所有变体,且与现有去中心化双层算法相比具有最优收敛速率。研究结果进一步表明,EXTRA和精确扩散技术更适用于去中心化双层优化,而在双层算法中采用混合策略比单纯依赖梯度跟踪能带来更多优势。