A Penalty Approach for Differentiation Through Black-Box Quadratic Programming Solvers

Differentiating through the solution of a quadratic program (QP) is a central problem in differentiable optimization. Most existing approaches differentiate through the Karush--Kuhn--Tucker (KKT) system, but their computational cost and numerical robustness can degrade at scale. To address these limitations, we propose dXPP, a penalty-based differentiation framework that decouples QP solving from differentiation. In the solving step (forward pass), dXPP is solver-agnostic and can leverage any black-box QP solver. In the differentiation step (backward pass), we map the solution to a smooth approximate penalty problem and implicitly differentiate through it, requiring only the solution of a much smaller linear system in the primal variables. This approach bypasses the difficulties inherent in explicit KKT differentiation and significantly improves computational efficiency and robustness. We evaluate dXPP on various tasks, including randomly generated QPs, large-scale sparse projection problems, and a real-world multi-period portfolio optimization task. Empirical results demonstrate that dXPP is competitive with KKT-based differentiation methods and achieves substantial speedups on large-scale problems. Our implementation is open source and available at https://github.com/mmmmmmlinghu/dXPP.

翻译：在可微优化中，通过二次规划（QP）解进行微分是一个核心问题。现有方法大多通过Karush--Kuhn--Tucker（KKT）系统进行微分，但其计算成本和数值鲁棒性在大规模问题中会显著下降。为解决这些限制，我们提出dXPP，一种基于惩罚的微分框架，将QP求解与微分解耦。在求解步骤（前向传播）中，dXPP与求解器无关，可利用任意黑盒QP求解器。在微分步骤（反向传播）中，我们将映射到一个平滑的近似惩罚问题，并对其隐式微分，仅需求解一个规模小得多的原始变量线性系统。该方法规避了显式KKT微分固有的困难，显著提升了计算效率和鲁棒性。我们在多种任务上评估了dXPP，包括随机生成的QP、大规模稀疏投影问题，以及一个真实的多时期投资组合优化任务。实验结果表明，dXPP与基于KKT的微分方法具有竞争力，并在大规模问题上实现了显著加速。我们的实现已开源，可在https://github.com/mmmmmmlinghu/dXPP获取。

相关内容

黑盒

关注 1

在科学，计算和工程学中，黑盒是一种设备，系统或对象，可以根据其输入和输出（或传输特性）对其进行查看，而无需对其内部工作有任何了解。它的实现是“不透明的”（黑色）。几乎任何事物都可以被称为黑盒：晶体管，引擎，算法，人脑，机构或政府。为了使用典型的“黑匣子方法”来分析建模为开放系统的事物，仅考虑刺激/响应的行为，以推断（未知）盒子。该黑匣子系统的通常表示形式是在该方框中居中的数据流程图。黑盒的对立面是一个内部组件或逻辑可用于检查的系统，通常将其称为白盒（有时也称为“透明盒”或“玻璃盒”）。

【ICML2025】Proxy-FDA：基于代理的特征分布对齐方法，用于无遗忘地微调视觉基础模型

专知会员服务

9+阅读 · 2025年6月3日

《不确定性和冲突下的优化：异质二次规划算法》项目总结报告

专知会员服务

31+阅读 · 2023年7月6日

【CMU博士论文】黑盒和多目标优化策略，151页pdf

专知会员服务

53+阅读 · 2022年11月24日

【牛津大学博士论文】解释黑盒算法:认识论挑战和机器学习解决方案，247页pdf

专知会员服务

59+阅读 · 2022年10月26日