论文标题
机器人技术深厚的加固学习中的SIM到现实转移:一项调查
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey
论文作者
论文摘要
深厚的强化学习最近在机器人领域的多个领域看到了巨大的成功。由于收集真实数据的局限性,即样本效率低下和收集它的成本,因此将模拟环境用于培训不同的代理。这不仅有助于提供潜在的无限数据源,而且还可以减轻对真正机器人的安全问题。但是,一旦模型转移到真实的机器人中,模拟世界和现实世界之间的差距就会降低政策的性能。因此,现在有多项研究工作致力于缩小这一SIM到现实的差距,并完成更有效的政策转移。近年来,出现了适用于不同领域的多种方法的出现,但是据我们所知,缺乏全面的评论,总结并将其列入不同方法。在本调查论文中,我们涵盖了深度强化学习中的SIM到现实转移背后的基本背景,并概述目前正在使用的主要方法:域随机化,域名适应性,模仿学习,元学习和知识蒸馏。我们对一些最相关的作品进行了分类,并概述了主要应用程序方案。最后,我们讨论了不同方法的主要机遇和挑战,并指出了最有希望的方向。
Deep reinforcement learning has recently seen huge success across multiple areas in the robotics domain. Owing to the limitations of gathering real-world data, i.e., sample inefficiency and the cost of collecting it, simulation environments are utilized for training the different agents. This not only aids in providing a potentially infinite data source, but also alleviates safety concerns with real robots. Nonetheless, the gap between the simulated and real worlds degrades the performance of the policies once the models are transferred into real robots. Multiple research efforts are therefore now being directed towards closing this sim-to-real gap and accomplish more efficient policy transfer. Recent years have seen the emergence of multiple methods applicable to different domains, but there is a lack, to the best of our knowledge, of a comprehensive review summarizing and putting into context the different methods. In this survey paper, we cover the fundamental background behind sim-to-real transfer in deep reinforcement learning and overview the main methods being utilized at the moment: domain randomization, domain adaptation, imitation learning, meta-learning and knowledge distillation. We categorize some of the most relevant recent works, and outline the main application scenarios. Finally, we discuss the main opportunities and challenges of the different approaches and point to the most promising directions.