RL-SPH：学习实现整数线性规划的可行解 (RL-SPH: Learning to Achieve Feasible Solutions for Integer Linear Programs) - 专知论文

会员服务 ·

0

可行 · Learning · 整数线性规划 · 线性的 · ILP ·

2025 年 6 月 2 日

RL-SPH: Learning to Achieve Feasible Solutions for Integer Linear Programs

翻译：RL-SPH：学习实现整数线性规划的可行解

Tae-Hoon Lee,Min-Soo Kim

from arxiv, Extended version (19 pages, 7 figures). Accepted at the 2025 AAAI Workshop on AI to Accelerate Science and Engineering (AI2ASE)

Integer linear programming (ILP) is widely utilized for various combinatorial optimization problems. Primal heuristics play a crucial role in quickly finding feasible solutions for NP-hard ILP. Although $\textit{end-to-end learning}$-based primal heuristics (E2EPH) have recently been proposed, they are typically unable to independently generate feasible solutions and mainly focus on binary variables. Ensuring feasibility is critical, especially when handling non-binary integer variables. To address this challenge, we propose RL-SPH, a novel reinforcement learning-based start primal heuristic capable of independently generating feasible solutions, even for ILP involving non-binary integers. Experimental results demonstrate that RL-SPH rapidly obtains high-quality feasible solutions, achieving on average a 44x lower primal gap and a 2.3x lower primal integral compared to existing primal heuristics.

翻译：整数线性规划（ILP）被广泛应用于各类组合优化问题。原始启发式方法在快速求解NP难整数线性规划可行解方面起着关键作用。尽管近期已提出基于端到端学习的原始启发式方法（E2EPH），但它们通常无法独立生成可行解，且主要针对二元变量处理。确保可行性至关重要，尤其是在处理非二元整数变量时。为应对这一挑战，我们提出RL-SPH——一种基于强化学习的新型起始原始启发式方法，能够独立生成可行解，即使对于包含非二元整数的整数线性规划问题亦如此。实验结果表明，相较于现有原始启发式方法，RL-SPH能以更快的速度获得高质量可行解，其原始间隙平均降低44倍，原始积分平均降低2.3倍。

0

相关内容

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

STRCF for Visual Object Tracking

STRCF for Visual Object Tracking

统计学习与视觉计算组

15+阅读 · 2018年5月29日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

KingsGarden

13+阅读 · 2017年7月16日

城市“建成环境——空间行为”的多尺度影响关系与机理研究

国家自然科学基金

13+阅读 · 2017年12月31日

Musielak-Orlicz-Sobolev 空间中的迹嵌入及其应用

国家自然科学基金

2+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

动态Gr？bner 基与GVW算法

国家自然科学基金

0+阅读 · 2014年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

Coordinate Condensation: Subspace-Accelerated Coordinate Descent for Physics-Based Simulation

Arxiv

0+阅读 · 2025年10月14日

TemplateRL: Structured Template-Guided Reinforcement Learning for LLM Reasoning

Arxiv

0+阅读 · 2025年10月13日

K-ASTRO: Structure-Aware Adaptation of LLMs for Code Vulnerability Detection

Arxiv

0+阅读 · 2025年10月10日

Haystack Engineering: Context Engineering for Heterogeneous and Agentic Long-Context Evaluation

Arxiv

0+阅读 · 2025年10月10日

Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training

Arxiv

0+阅读 · 2025年10月10日

VIP会员

文章信息

相关主题

整数线性规划

相关VIP内容

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

热门VIP内容

开通专知VIP会员享更多权益服务

《思考蜂群：基础、行为、拓扑与架构、认知、未来之路》400页书籍

【伯克利博士论文】协同语言智能体

新型军备竞赛：美军旨在争夺全球无人机主导地位

《乌克兰的无人机生态系统：经验教训》28页报告

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

STRCF for Visual Object Tracking

STRCF for Visual Object Tracking

统计学习与视觉计算组

15+阅读 · 2018年5月29日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

IJCAI | Cascade Dynamics Modeling with Attention-based RNN

KingsGarden

13+阅读 · 2017年7月16日

相关论文

Coordinate Condensation: Subspace-Accelerated Coordinate Descent for Physics-Based Simulation

Arxiv

0+阅读 · 2025年10月14日

TemplateRL: Structured Template-Guided Reinforcement Learning for LLM Reasoning

Arxiv

0+阅读 · 2025年10月13日

K-ASTRO: Structure-Aware Adaptation of LLMs for Code Vulnerability Detection

Arxiv

0+阅读 · 2025年10月10日

Haystack Engineering: Context Engineering for Heterogeneous and Agentic Long-Context Evaluation

Arxiv

0+阅读 · 2025年10月10日

Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training

Arxiv

0+阅读 · 2025年10月10日

相关基金

城市“建成环境——空间行为”的多尺度影响关系与机理研究

国家自然科学基金

13+阅读 · 2017年12月31日

Musielak-Orlicz-Sobolev 空间中的迹嵌入及其应用

国家自然科学基金

2+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

动态Gr？bner 基与GVW算法

国家自然科学基金

0+阅读 · 2014年12月31日

Poisson流形上的修正Hamilton方法

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员