论文标题

在自动课程中使用深度强化学习,以进行内部学术中的无图导航

Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics

论文作者

Xue, Honghu, Hein, Benedikt, Bakr, Mohamed, Schildbach, Georg, Abel, Bengt, Rueckert, Elmar

论文摘要

我们提出了一种深入的加固学习方法,用于解决仓库场景中的无地图导航问题。在我们的方法中,自动化指导车辆配备了LiDAR和FRORTAL RGB传感器,并学会执行有针对性的导航任务。这些挑战在于学习积极样本的稀疏性,用于学习,多模式传感器感知,部分可观察性,对准确转向操作的需求以及长期训练周期。为了解决这些要点,我们建议NAVACL-Q作为一种自动课程学习的方法,并结合使用软act-Critic算法的分布式版本。在看不见的仓库环境中对学习算法的性能进行了详尽的评估,以验证学习政策的鲁棒性和概括性。 NVIDIA ISAAC SIM的结果表明,我们训练的药物显着优于NVIDIA ISAAC SIM提供的基于地图的导航管道,其代理目标距离增加了3M,更广泛的初始相对代理 - 目标旋转为45度。消融研究还表明,与随机启动的训练相比,NAVACL-Q的性能增长大约40%,大约40%,而预先训练的特征提取器的利用显然可以使性能提高约60%。

We propose a deep reinforcement learning approach for solving a mapless navigation problem in warehouse scenarios. In our approach, an automation guided vehicle is equipped with LiDAR and frontal RGB sensors and learns to perform a targeted navigation task. The challenges reside in the sparseness of positive samples for learning, multi-modal sensor perception with partial observability, the demand for accurate steering maneuvers together with long training cycles. To address these points, we propose NavACL-Q as a method for automatic curriculum learning in combination with a distributed version of the soft actor-critic algorithm. The performance of the learning algorithm is evaluated exhaustively in an unseen warehouse environment to validate both robustness and generalizability of the learned policy. Results in NVIDIA Isaac Sim demonstrates that our trained agent significantly outperforms a map-based navigation pipeline provided by NVIDIA Isaac Sim with an increased agent-goal distance of 3m and wider initial relative agent-goal rotations of 45 degree. The ablation studies also suggests that NavACL-Q greatly facilitates the learning process with a performance gain of roughly 40% compared to training with random starts and that the utilization of a pre-trained feature extractor manifestly boosts the performance by approximately 60%.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源