Bilevel optimization enjoys a wide range of applications in hyper-parameter optimization, meta-learning and reinforcement learning. However, bilevel optimization problems are difficult to solve. Recent progress on scalable bilevel algorithms mainly focuses on bilevel optimization problems where the lower-level objective is either strongly convex or unconstrained. In this work, we tackle the bilevel problem through the lens of the penalty method. We show that under certain conditions, the penalty reformulation recovers the solutions of the original bilevel problem. Further, we propose the penalty-based bilevel gradient descent (PBGD) algorithm and establish its finite-time convergence for the constrained bilevel problem without lower-level strong convexity. Experiments showcase the efficiency of the proposed PBGD algorithm.
翻译:双层优化在超参数优化、元学习和强化学习中有着广泛的应用。然而,双层优化问题求解困难。近期可扩展双层算法的进展主要集中于下层目标函数为强凸或无条件约束的双层优化问题。本文从惩罚方法的视角解决双层问题。我们证明,在特定条件下,惩罚重构能够恢复原始双层问题的解。进一步,我们提出基于惩罚的双层梯度下降(PBGD)算法,并证明该算法在下层非强凸约束双层问题上的有限时间收敛性。实验展示了所提PBGD算法的有效性。