深入加强学习的基于动态的路线计划，以最大程度地减少旅行时间

论文标题

深入加强学习的基于动态的路线计划，以最大程度地减少旅行时间

Deep Reinforcement Learning Based Dynamic Route Planning for Minimizing Travel Time

论文作者

Geng, Yuanzhe, Liu, Erwu, Wang, Rui, Liu, Yiming

论文摘要

路线规划在运输中很重要。现有作品着重于找到最短的路径解决方案或使用诸如安全和能耗之类的指标来确定计划。值得注意的是，这些研究中的大多数都依赖于道路网络的先验知识，这在某些情况下可能无法获得。在本文中，我们设计了一种基于行人深入增强学习（DRL）的路线计划算法。我们将旅行时间消耗用作指标，并通过预测道路网络中的行人流程来计划路线。我们将代理（即智能机器人）放在虚拟地图上。与以前的研究不同，我们的方法假设代理不需要有关道路网络的任何先前信息，而只是依赖于与环境的互动。我们提出了一种动态可调节的路线计划（DARP）算法，在该算法中，代理商通过对决深层Q网络学习策略，以避免拥挤的道路。模拟结果表明，与传统的最短路径计划算法相比，DARP算法在充血条件下节省了52％的时间。

Route planning is important in transportation. Existing works focus on finding the shortest path solution or using metrics such as safety and energy consumption to determine the planning. It is noted that most of these studies rely on prior knowledge of road network, which may be not available in certain situations. In this paper, we design a route planning algorithm based on deep reinforcement learning (DRL) for pedestrians. We use travel time consumption as the metric, and plan the route by predicting pedestrian flow in the road network. We put an agent, which is an intelligent robot, on a virtual map. Different from previous studies, our approach assumes that the agent does not need any prior information about road network, but simply relies on the interaction with the environment. We propose a dynamically adjustable route planning (DARP) algorithm, where the agent learns strategies through a dueling deep Q network to avoid congested roads. Simulation results show that the DARP algorithm saves 52% of the time under congestion condition when compared with traditional shortest path planning algorithms.

下载PDF全文

下载文献需遵守相关版权规定

论文标题