PPMC RL培训算法：通过增强学习的粗糙地形智能机器人

论文标题

PPMC RL培训算法：通过增强学习的粗糙地形智能机器人

PPMC RL Training Algorithm: Rough Terrain Intelligent Robots through Reinforcement Learning

论文作者

Blum, Tamir, Yoshida, Kazuya

论文摘要

机器人现在可以学习如何做出决策和控制自己，将学习的行为推广到看不见的情况。特别是，由于环境不确定性，AI动力的机器人在月球表面等粗糙环境中显示出希望。我们通过我们创建的名为“路径计划和运动控制（PPMC）训练算法”创建的训练算法来解决机器人运动的这一关键概括方面。该算法与任何通用的增强学习算法结合使用，以教机器人如何响应用户命令并在单个神经网络上旅行到指定位置。在本文中，我们表明该算法独立于机器人结构，表明它在轮毂上起作用，此外还可以在四倍的行走机器人上进行。此外，我们通过引入一个崎rough不平的地形，迈出了几步，朝着现实世界的实用性迈出了几步。至关重要的是，我们通过实验表明，机器人学会将其推广到新的粗糙地形图，并保留100％的成功率。据我们所知，这是第一本引入通用培训算法在粗糙环境中向任何机器人讲通用PPMC的论文，仅使用增强学习。

Robots can now learn how to make decisions and control themselves, generalizing learned behaviors to unseen scenarios. In particular, AI powered robots show promise in rough environments like the lunar surface, due to the environmental uncertainties. We address this critical generalization aspect for robot locomotion in rough terrain through a training algorithm we have created called the Path Planning and Motion Control (PPMC) Training Algorithm. This algorithm is coupled with any generic reinforcement learning algorithm to teach robots how to respond to user commands and to travel to designated locations on a single neural network. In this paper, we show that the algorithm works independent of the robot structure, demonstrating that it works on a wheeled rover in addition the past results on a quadruped walking robot. Further, we take several big steps towards real world practicality by introducing a rough highly uneven terrain. Critically, we show through experiments that the robot learns to generalize to new rough terrain maps, retaining a 100% success rate. To the best of our knowledge, this is the first paper to introduce a generic training algorithm teaching generalized PPMC in rough environments to any robot, with just the use of reinforcement learning.

下载PDF全文

下载文献需遵守相关版权规定

论文标题