Pointer Network (PtrNet) is a specific neural network for solving Combinatorial Optimization Problems (COPs). While PtrNets offer real-time feed-forward inference for complex COPs instances, its quality of the results tends to be less satisfactory. One possible reason is that such issue suffers from the lack of global search ability of the gradient descent, which is frequently employed in traditional PtrNet training methods including both supervised learning and reinforcement learning. To improve the performance of PtrNet, this paper delves deeply into the advantages of training PtrNet with Evolutionary Algorithms (EAs), which have been widely acknowledged for not easily getting trapped by local optima. Extensive empirical studies based on the Travelling Salesman Problem (TSP) have been conducted. Results demonstrate that PtrNet trained with EA can consistently perform much better inference results than eight state-of-the-art methods on various problem scales. Compared with gradient descent based PtrNet training methods, EA achieves up to 30.21\% improvement in quality of the solution with the same computational time. With this advantage, this paper is able to at the first time report the results of solving 1000-dimensional TSPs by training a PtrNet on the same dimensionality, which strongly suggests that scaling up the training instances is in need to improve the performance of PtrNet on solving higher-dimensional COPs.
翻译:指针网络(PtrNet)是一种专门用于解决组合优化问题(COPs)的神经网络。虽然PtrNet能够对复杂COPs实例进行实时前馈推理,但其结果质量往往不尽如人意。一个可能的原因是,传统PtrNet训练方法(包括监督学习和强化学习)中广泛采用的梯度下降法缺乏全局搜索能力。为提升PtrNet性能,本文深入研究了使用进化算法(EAs)训练PtrNet的优势——众所周知,进化算法不易陷入局部最优解。基于旅行商问题(TSP)的大量实证研究表明,采用EA训练的PtrNet能在不同问题规模上持续产生优于八种最先进方法的推理结果。与基于梯度下降的PtrNet训练方法相比,EA在相同计算时间内可将解的质量提升高达30.21%。凭借这一优势,本文首次实现了通过训练同维PtrNet求解1000维TSP的结果报告,这充分表明扩大训练实例规模是提升PtrNet解决高维COPs性能的必要手段。