Graph Attention-based Deep Reinforcement Learning for solving the Chinese Postman Problem with Load-dependent costs

Recently, Deep reinforcement learning (DRL) models have shown promising results in solving routing problems. However, most DRL solvers are commonly proposed to solve node routing problems, such as the Traveling Salesman Problem (TSP). Meanwhile, there has been limited research on applying neural methods to arc routing problems, such as the Chinese Postman Problem (CPP), since they often feature irregular and complex solution spaces compared to TSP. To fill these gaps, this paper proposes a novel DRL framework to address the CPP with load-dependent costs (CPP-LC) (Corberan et al., 2018), which is a complex arc routing problem with load constraints. The novelty of our method is two-fold. First, we formulate the CPP-LC as a Markov Decision Process (MDP) sequential model. Subsequently, we introduce an autoregressive model based on DRL, namely Arc-DRL, consisting of an encoder and decoder to address the CPP-LC challenge effectively. Such a framework allows the DRL model to work efficiently and scalably to arc routing problems. Furthermore, we propose a new bio-inspired meta-heuristic solution based on Evolutionary Algorithm (EA) for CPP-LC. Extensive experiments show that Arc-DRL outperforms existing meta-heuristic methods such as Iterative Local Search (ILS) and Variable Neighborhood Search (VNS) proposed by (Corberan et al., 2018) on large benchmark datasets for CPP-LC regarding both solution quality and running time; while the EA gives the best solution quality with much more running time. We release our C++ implementations for metaheuristics such as EA, ILS and VNS along with the code for data generation and our generated data at https://github.com/HySonLab/Chinese_Postman_Problem

翻译：近年来，深度强化学习模型在求解路径规划问题方面展现出了良好的效果。然而，大多数深度强化学习求解器通常用于求解节点路径规划问题，如旅行商问题。同时，由于弧路径规划问题（如中国邮路问题）相较于旅行商问题通常具有不规则且复杂的解空间，目前将神经网络方法应用于此类问题的研究较为有限。为填补这一空白，本文提出了一种新颖的深度强化学习框架，用于求解带负载依赖成本的中国邮路问题，这是带负载约束的复杂弧路径规划问题。我们方法的新颖性体现在两个方面：首先，我们将该问题建模为马尔可夫决策过程的序列模型；其次，我们引入了一种基于深度强化学习的自回归模型（命名为Arc-DRL），该模型由编码器和解码器组成，能够有效应对该问题的挑战。这一框架使得深度强化学习模型能够高效且可扩展地处理弧路径规划问题。此外，我们还提出了一种基于进化算法的生物启发式元启发式求解方法。大量实验表明，在大型基准数据集上，Arc-DRL在解质量和运行时间方面均优于现有元启发式方法（如迭代局部搜索和变邻域搜索）；而进化算法虽能获得最佳解质量，但所需运行时间显著增加。我们将进化算法、迭代局部搜索和变邻域搜索等元启发式算法的C++实现代码、数据生成代码以及生成的数据集发布在https://github.com/HySonLab/Chinese_Postman_Problem。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日