通过增强的多模式增强学习来稳健坡道合并

论文标题

通过增强的多模式增强学习来稳健坡道合并

Towards Robust On-Ramp Merging via Augmented Multimodal Reinforcement Learning

论文作者

Bagwe, Gaurav, Li, Jian, Yuan, Xiaoyong, Zhang, Lan

论文摘要

尽管在船上感知方面取得了成功，但在自动驾驶的主要挑战中，越来越多的合并之一。由于车载传感器的传感范围有限，合并车辆几乎无法观察到主要的道路条件并正确合并。通过利用连接车辆和自动化车辆（CAVS）之间的无线通信，合并的CAV有可能主动获得附近车辆的意图。但是，骑士可能容易出现不准确的观察结果，例如嘈杂的基本安全消息（BSM）和质量差监视图像。在本文中，我们通过RAMRL命名的增强和多模式的增强学习，提出了一种新颖的方法，可以通过增强和多模式的增强学习进行新颖的方法。具体来说，我们通过考虑驾驶安全性，舒适的驾驶行为和交通效率来提出坡道合并问题作为马尔可夫决策过程（MDP）。为了提供可靠的合并操作，我们同时利用BSM和监视图像进行多模式观察，该观察用于通过近端策略优化（PPO）学习策略模型。此外，为了提高数据效率并提供更好的概括性能，我们通过增强数据（例如嘈杂的BSM和嘈杂的监视图像）训练政策模型。在两个典型的合并场景下，通过模拟城市流动性（SUMO）平台进行了广泛的实验。实验结果证明了我们强大的坡道合并设计的有效性和效率。

Despite the success of AI-enabled onboard perception, on-ramp merging has been one of the main challenges for autonomous driving. Due to limited sensing range of onboard sensors, a merging vehicle can hardly observe main road conditions and merge properly. By leveraging the wireless communications between connected and automated vehicles (CAVs), a merging CAV has potential to proactively obtain the intentions of nearby vehicles. However, CAVs can be prone to inaccurate observations, such as the noisy basic safety messages (BSM) and poor quality surveillance images. In this paper, we present a novel approach for Robust on-ramp merge of CAVs via Augmented and Multi-modal Reinforcement Learning, named by RAMRL. Specifically, we formulate the on-ramp merging problem as a Markov decision process (MDP) by taking driving safety, comfort driving behavior, and traffic efficiency into account. To provide reliable merging maneuvers, we simultaneously leverage BSM and surveillance images for multi-modal observation, which is used to learn a policy model through proximal policy optimization (PPO). Moreover, to improve data efficiency and provide better generalization performance, we train the policy model with augmented data (e.g., noisy BSM and noisy surveillance images). Extensive experiments are conducted with Simulation of Urban MObility (SUMO) platform under two typical merging scenarios. Experimental results demonstrate the effectiveness and efficiency of our robust on-ramp merging design.

下载PDF全文

下载文献需遵守相关版权规定

论文标题