Autonomous navigation is challenging for mobile robots, especially in an unknown environment. Commonly, the robot requires multiple sensors to map the environment, locate itself, and make a plan to reach the target. However, reinforcement learning methods offer an alternative to map-free navigation tasks by learning the optimal actions to take. In this article, deep reinforcement learning agents are implemented using variants of the deep Q networks method, the D3QN and rainbow algorithms, for both the obstacle avoidance and the goal-oriented navigation task. The agents are trained and evaluated in a simulated environment. Furthermore, an analysis of the changes in the behaviour and performance of the agents caused by modifications in the reward function is conducted.
翻译:自主导航对于移动机器人而言极具挑战性,尤其是在未知环境中。通常,机器人需要多种传感器来构建环境地图、定位自身并规划到达目标的路径。然而,强化学习方法通过学习最优动作,为无需地图的导航任务提供了替代方案。本文采用深度Q网络方法的变体——D3QN和rainbow算法,实现了深度强化学习智能体,用于避障和目标导向导航任务。智能体在模拟环境中进行训练和评估。此外,本文分析了奖励函数修改对智能体行为与性能变化的影响。