RLPG：用于高速公路上合并的动态内式内部差距适应的增强学习方法

论文标题

RLPG：用于高速公路上合并的动态内式内部差距适应的增强学习方法

RLPG: Reinforcement Learning Approach for Dynamic Intra-Platoon Gap Adaptation for Highway On-Ramp Merging

论文作者

Yadavalli, Sushma Reddy, Das, Lokesh Chandra, Won, Myounggyu

论文摘要

排是指使用自动驾驶技术非常接近的一组车辆。由于其提高燃油效率，驾驶安全性和驾驶员舒适性的巨大能力，因此引起了自动驾驶汽车研究社区的极大关注。尽管非常有利，但最近的研究发现，平板内间隙过多会阻碍高速公路上坡道合并期间的交通流量。尽管现有的基于控制的方法允许适应平台内差距以改善交通流量，但由于巨大的计算复杂性，在交通状况的复杂动态下做出最佳控制决策仍然是一个挑战。在本文中，我们介绍了一种新颖的增强学习框架的设计，实施和评估，该框架可适应地调整单个排成员的平台内差距，以最大程度地提高流量流，以响应高速公路上的跨越坡道合并的动态变化，复杂的交通状况。该框架的状态空间是根据交通文献进行了精心设计的，以考虑到与合并效率直接相关的关键流量参数。创建基于深层确定性策略梯度算法的平台内差距决策方法，以结合连续的动作空间，以确保平台内间隙的精确和连续适应。一项广泛的仿真研究表明，基于强化学习的方法在各种高速公路上合并方案中有效改善交通流量的有效性。

A platoon refers to a group of vehicles traveling together in very close proximity using automated driving technology. Owing to its immense capacity to improve fuel efficiency, driving safety, and driver comfort, platooning technology has garnered substantial attention from the autonomous vehicle research community. Although highly advantageous, recent research has uncovered that an excessively small intra-platoon gap can impede traffic flow during highway on-ramp merging. While existing control-based methods allow for adaptation of the intra-platoon gap to improve traffic flow, making an optimal control decision under the complex dynamics of traffic conditions remains a challenge due to the massive computational complexity. In this paper, we present the design, implementation, and evaluation of a novel reinforcement learning framework that adaptively adjusts the intra-platoon gap of an individual platoon member to maximize traffic flow in response to dynamically changing, complex traffic conditions for highway on-ramp merging. The framework's state space has been meticulously designed in consultation with the transportation literature to take into account critical traffic parameters that bear direct relevance to merging efficiency. An intra-platoon gap decision making method based on the deep deterministic policy gradient algorithm is created to incorporate the continuous action space to ensure precise and continuous adaptation of the intra-platoon gap. An extensive simulation study demonstrates the effectiveness of the reinforcement learning-based approach for significantly improving traffic flow in various highway on-ramp merging scenarios.

下载PDF全文

下载文献需遵守相关版权规定

论文标题