Learning latent costs of transitions on graphs from trajectories demonstrations under various contextual features is challenging but useful for path planning. Yet, existing methods either oversimplify cost assumptions or scale poorly with the number of observed trajectories. This paper introduces DataSP, a differentiable all-to-all shortest path algorithm to facilitate learning latent costs from trajectories. It allows to learn from a large number of trajectories in each learning step without additional computation. Complex latent cost functions from contextual features can be represented in the algorithm through a neural network approximation. We further propose a method to sample paths from DataSP in order to reconstruct/mimic observed paths' distributions. We prove that the inferred distribution follows the maximum entropy principle. We show that DataSP outperforms state-of-the-art differentiable combinatorial solver and classical machine learning approaches in predicting paths on graphs.
翻译:从具有各种上下文特征的轨迹演示中学习图上的潜在转移代价具有挑战性,但对路径规划非常有用。然而,现有方法要么过度简化代价假设,要么随着观测轨迹数量的增加而扩展性差。本文提出了DataSP,一种可微的全对最短路径算法,用于从轨迹中学习潜在代价。它允许在每个学习步骤中从大量轨迹中学习,无需额外计算。通过神经网络近似,该算法可以表示来自上下文特征的复杂潜在代价函数。我们进一步提出了一种从DataSP中采样路径的方法,以重建/模仿观测路径的分布。我们证明了推断的分布遵循最大熵原理。我们展示了DataSP在预测图上路径方面优于最先进的可微组合求解器和经典机器学习方法。