Many machine learning tasks can be formulated as a stochastic compositional optimization (SCO) problem such as reinforcement learning, AUC maximization, and meta-learning, where the objective function involves a nested composition associated with an expectation. While a significant amount of studies has been devoted to studying the convergence behavior of SCO algorithms, there is little work on understanding their generalization, i.e., how these learning algorithms built from training examples would behave on future test examples. In this paper, we provide the stability and generalization analysis of stochastic compositional gradient descent algorithms through the lens of algorithmic stability in the framework of statistical learning theory. Firstly, we introduce a stability concept called compositional uniform stability and establish its quantitative relation with generalization for SCO problems. Then, we establish the compositional uniform stability results for two popular stochastic compositional gradient descent algorithms, namely SCGD and SCSC. Finally, we derive dimension-independent excess risk bounds for SCGD and SCSC by trade-offing their stability results and optimization errors. To the best of our knowledge, these are the first-ever-known results on stability and generalization analysis of stochastic compositional gradient descent algorithms.
翻译:许多机器学习任务可被建模为随机复合优化问题,例如强化学习、AUC最大化以及元学习,其中目标函数涉及与期望相关的嵌套复合结构。尽管已有大量研究致力于分析随机复合优化算法的收敛行为,但对其泛化性能的理解——即基于训练样本构建的学习算法在未来测试样本上的表现——仍十分有限。本文通过统计学习理论框架下的算法稳定性视角,对随机复合梯度下降算法的稳定性与泛化性能进行系统分析。首先,我们提出称为复合一致稳定性的新概念,并建立其与SCO问题泛化能力的定量关系。随后,针对两种主流随机复合梯度下降算法SCGD与SCSC,建立了其复合一致稳定性结果。最终,通过平衡稳定性结果与优化误差,推导出SCGD与SCSC的维数无关过余风险上界。据我们所知,这是首次对随机复合梯度下降算法进行稳定性与泛化分析的研究成果。