基于增强学习的人类机器人协作的用户指导运动计划

论文标题

基于增强学习的人类机器人协作的用户指导运动计划

Reinforcement Learning Based User-Guided Motion Planning for Human-Robot Collaboration

论文作者

Yu, Tian, Chang, Qing

论文摘要

机器人擅长执行现代制造业的重复任务。但是，机器人的动作大多是计划和预编程的，显然缺乏对任务更改的适应性。即使对于略有变化的任务，整个系统也必须由机器人专家重新编程。因此，非常希望拥有一种灵活的运动计划方法，该方法可以使用该方法适应非结构化环境（例如生产系统或仓库）中的特定任务变化，而非专家人员很少干预。在本文中，我们提出了一种用户引导的运动计划算法，并结合了增强学习方法（RL）方法，以使机器人通过从几个动力学人类的一些示范中学习，从而自动为新任务生成运动计划。为了实现特定应用程序环境的自适应运动计划，例如，台式组件或仓库加载/卸载，库是通过抽象人类演示的任务的抽象功能来构建的。提出了库中特征和新任务功能之间的语义相似性的定义，并进一步用于在RL中构建奖励功能。如果RL策略确定可以对当前库满足新任务约束并要求其他人类演示，则可以自动为新任务生成运动计划。对常见任务和方案进行的多次实验表明，提出的用户引导的RL辅助运动计划方法有效。

Robots are good at performing repetitive tasks in modern manufacturing industries. However, robot motions are mostly planned and preprogrammed with a notable lack of adaptivity to task changes. Even for slightly changed tasks, the whole system must be reprogrammed by robotics experts. Therefore, it is highly desirable to have a flexible motion planning method, with which robots can adapt to specific task changes in unstructured environments, such as production systems or warehouses, with little or no intervention from non-expert personnel. In this paper, we propose a user-guided motion planning algorithm in combination with the reinforcement learning (RL) method to enable robots automatically generate their motion plans for new tasks by learning from a few kinesthetic human demonstrations. To achieve adaptive motion plans for a specific application environment, e.g., desk assembly or warehouse loading/unloading, a library is built by abstracting features of common human demonstrated tasks. The definition of semantical similarity between features in the library and features of a new task is proposed and further used to construct the reward function in RL. The RL policy can automatically generate motion plans for a new task if it determines that new task constraints can be satisfied with the current library and request additional human demonstrations. Multiple experiments conducted on common tasks and scenarios demonstrate that the proposed user-guided RL-assisted motion planning method is effective.

下载PDF全文

下载文献需遵守相关版权规定

论文标题