基于改进网络嵌入的深度强化学习求解有限时间范围车辆路径规划问题 (Vehicle Routing with Finite Time Horizon using Deep Reinforcement Learning with Improved Network Embedding) - 专知论文

会员服务 ·

0

嵌入 · 路径 · 有限时间 · 路径规划 · 网络嵌入 ·

Vehicle Routing with Finite Time Horizon using Deep Reinforcement Learning with Improved Network Embedding

翻译：基于改进网络嵌入的深度强化学习求解有限时间范围车辆路径规划问题

Ayan Maity,Sudeshna Sarkar

from arxiv, Accepted at AAAI-26 Workshop on AI for Urban Planning

In this paper, we study the vehicle routing problem with a finite time horizon. In this routing problem, the objective is to maximize the number of customer requests served within a finite time horizon. We present a novel routing network embedding module which creates local node embedding vectors and a context-aware global graph representation. The proposed Markov decision process for the vehicle routing problem incorporates the node features, the network adjacency matrix and the edge features as components of the state space. We incorporate the remaining finite time horizon into the network embedding module to provide a proper routing context to the embedding module. We integrate our embedding module with a policy gradient-based deep Reinforcement Learning framework to solve the vehicle routing problem with finite time horizon. We trained and validated our proposed routing method on real-world routing networks, as well as synthetically generated Euclidean networks. Our experimental results show that our method achieves a higher customer service rate than the existing routing methods. Additionally, the solution time of our method is significantly lower than that of the existing methods.

翻译：本文研究有限时间范围内的车辆路径规划问题。在该路径规划问题中，目标是在有限时间范围内最大化已服务的客户请求数量。我们提出了一种新颖的路由网络嵌入模块，该模块可生成局部节点嵌入向量和上下文感知的全局图表示。针对车辆路径规划问题提出的马尔可夫决策过程，将节点特征、网络邻接矩阵和边特征作为状态空间的组成部分。我们将剩余有限时间范围纳入网络嵌入模块，为嵌入模块提供适当的路径规划上下文。我们将所提出的嵌入模块与基于策略梯度的深度强化学习框架相结合，以求解有限时间范围的车辆路径规划问题。我们在真实世界路由网络以及人工生成的欧几里得网络上对所提出的路由方法进行了训练和验证。实验结果表明，与现有路由方法相比，我们的方法实现了更高的客户服务率。此外，我们方法的求解时间显著低于现有方法。

0

相关内容

基于强化学习的无人机自组网路由研究综述

基于强化学习的无人机自组网路由研究综述

专知会员服务

48+阅读 · 2023年9月9日

【阿姆斯特丹博士论文】组合空间的学习与优化:专注于车辆路径的深度学习，172页pdf

【阿姆斯特丹博士论文】组合空间的学习与优化:专注于车辆路径的深度学习，172页pdf

专知会员服务

41+阅读 · 2023年3月20日

基于模型的强化学习综述

基于模型的强化学习综述

专知会员服务

48+阅读 · 2023年1月9日

深度学习在路由问题中的最新进展

深度学习在路由问题中的最新进展

专知会员服务

19+阅读 · 2022年3月6日

如何在交通领域构建基于图的深度学习体系结构:一个综述，How to Build a Graph-Based Deep Learning Architecture in Traffic Domain: A Survey

如何在交通领域构建基于图的深度学习体系结构:一个综述，How to Build a Graph-Based Deep Learning Architecture in Traffic Domain: A Survey

专知会员服务

51+阅读 · 2020年5月26日

深度学习在自动车辆控制中的应用研究综述（A Survey of Deep Learning Applications to Autonomous Vehicle Control）

深度学习在自动车辆控制中的应用研究综述（A Survey of Deep Learning Applications to Autonomous Vehicle Control）

专知会员服务

34+阅读 · 2019年12月25日

【KDD2019|讲座推荐】深强化学习及其在交通运输中的应用：Deep Reinforcement Learning with Applications in Transportation

【KDD2019|讲座推荐】深强化学习及其在交通运输中的应用：Deep Reinforcement Learning with Applications in Transportation

专知会员服务

57+阅读 · 2019年12月4日

【WSDM 2020 论文】网络嵌入的初始化：一种图划分方法（Initialization for Network Embedding: A Graph Partition Approach）

【WSDM 2020 论文】网络嵌入的初始化：一种图划分方法（Initialization for Network Embedding: A Graph Partition Approach）

专知会员服务

44+阅读 · 2019年11月20日

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

专知会员服务

13+阅读 · 2019年11月17日

【KDD 2019|Tutorial】应用在交通中的强化学习 Deep Reinforcement Learning with Applications in Transportation，滴滴 AI Labs

【KDD 2019|Tutorial】应用在交通中的强化学习 Deep Reinforcement Learning with Applications in Transportation，滴滴 AI Labs

专知会员服务

65+阅读 · 2019年8月8日

推荐！《基于多智能体学习的任务分配动态邻域优化》2022最新41页综述论文，伦敦国王学院

推荐！《基于多智能体学习的任务分配动态邻域优化》2022最新41页综述论文，伦敦国王学院

专知

17+阅读 · 2022年11月15日

《通过近似动态规划解决具有动态目标到达的多Agent路由问题》美国空军大学130页学位论文

《通过近似动态规划解决具有动态目标到达的多Agent路由问题》美国空军大学130页学位论文

专知

15+阅读 · 2022年7月22日

图神经网络如何时序化？看Twitter最新《动态图深度学习:时序图网络TGN》研究，附论文与PPT下载

图神经网络如何时序化？看Twitter最新《动态图深度学习:时序图网络TGN》研究，附论文与PPT下载

专知

17+阅读 · 2021年1月24日

图节点嵌入(Node Embeddings)概述，9页pdf

图节点嵌入(Node Embeddings)概述，9页pdf

专知

15+阅读 · 2020年8月22日

当深度强化学习遇见图神经网络

当深度强化学习遇见图神经网络

专知

227+阅读 · 2019年10月21日

车路协同应用场景分析

车路协同应用场景分析

智能交通技术

24+阅读 · 2019年4月13日

PlaNet 简介：用于强化学习的深度规划网络

PlaNet 简介：用于强化学习的深度规划网络

谷歌开发者

13+阅读 · 2019年3月16日

548页MIT强化学习教程，收藏备用【PDF下载】

548页MIT强化学习教程，收藏备用【PDF下载】

机器学习算法与Python学习

17+阅读 · 2018年10月11日

数据增强：数据有限时如何使用深度学习？（续）

数据增强：数据有限时如何使用深度学习？（续）

AI研习社

14+阅读 · 2018年5月6日

【论文推荐】最新八篇网络节点表示相关论文—可扩展嵌入、对抗自编码器、图划分、异构信息、显式矩阵分解、深度高斯、图、随机游走

【论文推荐】最新八篇网络节点表示相关论文—可扩展嵌入、对抗自编码器、图划分、异构信息、显式矩阵分解、深度高斯、图、随机游走

专知

14+阅读 · 2018年3月30日

软件定义网络（SDN）环境下基于机器学习的路由预规划研究

国家自然科学基金

6+阅读 · 2015年12月31日

面向车联网的交通网络涌现行为建模

国家自然科学基金

8+阅读 · 2015年12月31日

车联网环境下基于路段负载链估测与优化的动态交通诱导方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

异构车联网协作数据传输关键技术的建模分析及优化算法研究

国家自然科学基金

4+阅读 · 2015年12月31日

面向智能交通的车联网时空数据流异常分析研究

国家自然科学基金

7+阅读 · 2015年12月31日

基于实时路况的乘用车经济环保出行路径规划方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

面向车联网的道路交通事故链动态演变规律及其阻断方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于排队模型的动态车辆路径问题实时优化策略及算法研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于神经网络和强化学习的车辆装配系统中的多载量小车实时调度方法

国家自然科学基金

4+阅读 · 2014年12月31日

利用复杂网络理論优化车载通信网络

国家自然科学基金

1+阅读 · 2014年12月31日

Spatiotemporal Feature Alignment and Weighted Fusion in Collaborative Perception Enabled by Network Synchronization and Age of Information

Arxiv

0+阅读 · 2月13日

Continuous-time reinforcement learning: ellipticity enables model-free value function approximation

Arxiv

0+阅读 · 2月6日

Quantum Reinforcement Learning with Transformers for the Capacitated Vehicle Routing Problem

Quantum Reinforcement Learning with Transformers for the Capacitated Vehicle Routing Problem

Arxiv

0+阅读 · 2月5日

Fast Non-Episodic Finite-Horizon RL with K-Step Lookahead Thresholding

Arxiv

0+阅读 · 1月31日

Adapting Reinforcement Learning for Path Planning in Constrained Parking Scenarios

Arxiv

0+阅读 · 1月30日

Speeding up Local Optimization in Vehicle Routing with Tensor-based GPU Acceleration

Arxiv

0+阅读 · 1月29日

Improved Approximations for the Unsplittable Capacitated Vehicle Routing Problem

Arxiv

0+阅读 · 1月29日

Improved Approximations for Dial-a-Ride Problems

Arxiv

0+阅读 · 1月29日

A Curriculum-Based Deep Reinforcement Learning Framework for the Electric Vehicle Routing Problem

Arxiv

0+阅读 · 1月21日

Policy-Based Deep Reinforcement Learning Hyperheuristics for Job-Shop Scheduling Problems

Arxiv

0+阅读 · 1月16日

VIP会员

文章信息

相关主题

相关VIP内容

基于强化学习的无人机自组网路由研究综述

基于强化学习的无人机自组网路由研究综述

专知会员服务

48+阅读 · 2023年9月9日

【阿姆斯特丹博士论文】组合空间的学习与优化:专注于车辆路径的深度学习，172页pdf

【阿姆斯特丹博士论文】组合空间的学习与优化:专注于车辆路径的深度学习，172页pdf

专知会员服务

41+阅读 · 2023年3月20日

基于模型的强化学习综述

基于模型的强化学习综述

专知会员服务

48+阅读 · 2023年1月9日

深度学习在路由问题中的最新进展

深度学习在路由问题中的最新进展

专知会员服务

19+阅读 · 2022年3月6日

如何在交通领域构建基于图的深度学习体系结构:一个综述，How to Build a Graph-Based Deep Learning Architecture in Traffic Domain: A Survey

如何在交通领域构建基于图的深度学习体系结构:一个综述，How to Build a Graph-Based Deep Learning Architecture in Traffic Domain: A Survey

专知会员服务

51+阅读 · 2020年5月26日

深度学习在自动车辆控制中的应用研究综述（A Survey of Deep Learning Applications to Autonomous Vehicle Control）

深度学习在自动车辆控制中的应用研究综述（A Survey of Deep Learning Applications to Autonomous Vehicle Control）

专知会员服务

34+阅读 · 2019年12月25日

【KDD2019|讲座推荐】深强化学习及其在交通运输中的应用：Deep Reinforcement Learning with Applications in Transportation

【KDD2019|讲座推荐】深强化学习及其在交通运输中的应用：Deep Reinforcement Learning with Applications in Transportation

专知会员服务

57+阅读 · 2019年12月4日

【WSDM 2020 论文】网络嵌入的初始化：一种图划分方法（Initialization for Network Embedding: A Graph Partition Approach）

【WSDM 2020 论文】网络嵌入的初始化：一种图划分方法（Initialization for Network Embedding: A Graph Partition Approach）

专知会员服务

44+阅读 · 2019年11月20日

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

专知会员服务

13+阅读 · 2019年11月17日

【KDD 2019|Tutorial】应用在交通中的强化学习 Deep Reinforcement Learning with Applications in Transportation，滴滴 AI Labs

【KDD 2019|Tutorial】应用在交通中的强化学习 Deep Reinforcement Learning with Applications in Transportation，滴滴 AI Labs

专知会员服务

65+阅读 · 2019年8月8日

热门VIP内容

开通专知VIP会员享更多权益服务

《可信人工智能赋能系统的支柱》

《从经典神经网络到不确定性下的拓扑神经网络：军事应用》2026最新40页报告

人工智能赋能边缘与自主系统：美陆军现代化进程聚焦威胁探测与战术边缘情报

《人工智能：对战略与力量的影响》slides

相关资讯

推荐！《基于多智能体学习的任务分配动态邻域优化》2022最新41页综述论文，伦敦国王学院

推荐！《基于多智能体学习的任务分配动态邻域优化》2022最新41页综述论文，伦敦国王学院

专知

17+阅读 · 2022年11月15日

《通过近似动态规划解决具有动态目标到达的多Agent路由问题》美国空军大学130页学位论文

《通过近似动态规划解决具有动态目标到达的多Agent路由问题》美国空军大学130页学位论文

专知

15+阅读 · 2022年7月22日

图神经网络如何时序化？看Twitter最新《动态图深度学习:时序图网络TGN》研究，附论文与PPT下载

图神经网络如何时序化？看Twitter最新《动态图深度学习:时序图网络TGN》研究，附论文与PPT下载

专知

17+阅读 · 2021年1月24日

图节点嵌入(Node Embeddings)概述，9页pdf

图节点嵌入(Node Embeddings)概述，9页pdf

专知

15+阅读 · 2020年8月22日

当深度强化学习遇见图神经网络

当深度强化学习遇见图神经网络

专知

227+阅读 · 2019年10月21日

车路协同应用场景分析

车路协同应用场景分析

智能交通技术

24+阅读 · 2019年4月13日

PlaNet 简介：用于强化学习的深度规划网络

PlaNet 简介：用于强化学习的深度规划网络

谷歌开发者

13+阅读 · 2019年3月16日

548页MIT强化学习教程，收藏备用【PDF下载】

548页MIT强化学习教程，收藏备用【PDF下载】

机器学习算法与Python学习

17+阅读 · 2018年10月11日

数据增强：数据有限时如何使用深度学习？（续）

数据增强：数据有限时如何使用深度学习？（续）

AI研习社

14+阅读 · 2018年5月6日

【论文推荐】最新八篇网络节点表示相关论文—可扩展嵌入、对抗自编码器、图划分、异构信息、显式矩阵分解、深度高斯、图、随机游走

【论文推荐】最新八篇网络节点表示相关论文—可扩展嵌入、对抗自编码器、图划分、异构信息、显式矩阵分解、深度高斯、图、随机游走

专知

14+阅读 · 2018年3月30日

相关论文

Spatiotemporal Feature Alignment and Weighted Fusion in Collaborative Perception Enabled by Network Synchronization and Age of Information

Arxiv

0+阅读 · 2月13日

Continuous-time reinforcement learning: ellipticity enables model-free value function approximation

Arxiv

0+阅读 · 2月6日

Quantum Reinforcement Learning with Transformers for the Capacitated Vehicle Routing Problem

Quantum Reinforcement Learning with Transformers for the Capacitated Vehicle Routing Problem

Arxiv

0+阅读 · 2月5日

Fast Non-Episodic Finite-Horizon RL with K-Step Lookahead Thresholding

Arxiv

0+阅读 · 1月31日

Adapting Reinforcement Learning for Path Planning in Constrained Parking Scenarios

Arxiv

0+阅读 · 1月30日

Speeding up Local Optimization in Vehicle Routing with Tensor-based GPU Acceleration

Arxiv

0+阅读 · 1月29日

Improved Approximations for the Unsplittable Capacitated Vehicle Routing Problem

Arxiv

0+阅读 · 1月29日

Improved Approximations for Dial-a-Ride Problems

Arxiv

0+阅读 · 1月29日

A Curriculum-Based Deep Reinforcement Learning Framework for the Electric Vehicle Routing Problem

Arxiv

0+阅读 · 1月21日

Policy-Based Deep Reinforcement Learning Hyperheuristics for Job-Shop Scheduling Problems

Arxiv

0+阅读 · 1月16日

相关基金

软件定义网络（SDN）环境下基于机器学习的路由预规划研究

国家自然科学基金

6+阅读 · 2015年12月31日

面向车联网的交通网络涌现行为建模

国家自然科学基金

8+阅读 · 2015年12月31日

车联网环境下基于路段负载链估测与优化的动态交通诱导方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

异构车联网协作数据传输关键技术的建模分析及优化算法研究

国家自然科学基金

4+阅读 · 2015年12月31日

面向智能交通的车联网时空数据流异常分析研究

国家自然科学基金

7+阅读 · 2015年12月31日

基于实时路况的乘用车经济环保出行路径规划方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

面向车联网的道路交通事故链动态演变规律及其阻断方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于排队模型的动态车辆路径问题实时优化策略及算法研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于神经网络和强化学习的车辆装配系统中的多载量小车实时调度方法

国家自然科学基金

4+阅读 · 2014年12月31日

利用复杂网络理論优化车载通信网络

国家自然科学基金

1+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员