学习控制政策，用于预防秋季的安全性和安全性。

论文标题

学习控制政策，用于预防秋季的安全性和安全性。

Learning Control Policies for Fall prevention and safety in bipedal locomotion

论文作者

Kumar, Visak

论文摘要

从意外的外部扰动中恢复的能力是两足球运动的基本运动技能。一个有效的响应包括不仅能够恢复平衡和维持稳定性的能力，而且还可以在平衡恢复在物理上不可行时以安全的方式跌落。对于与两足球运动相关的机器人，例如人形机器人和辅助机器人设备，可帮助人类步行，设计可以提供这种稳定性和安全性的控制器可以防止机器人损坏或防止与伤害相关的医疗费用。这是一项具有挑战性的任务，因为它涉及为具有触点的高维，非线性和射入不足的系统产生高度动态的运动。尽管在使用基于模型和优化的方法方面的先前进步，但诸如对广泛领域知识的要求，相对较大的计算时间和动态变化的鲁棒性的挑战仍然使这是一个开放的问题。在本论文中，为了解决这些问题，我们开发了能够为两种不同类型的机器人综合的基于学习的算法：人形机器人和辅助机器人设备，从而有助于两足动力。我们的工作可以分为两个密切相关的方向：1）学习人形机器人的安全跌倒和跌倒策略； 2）学习使用机器人辅助设备为人类的秋季预防策略。为了实现这一目标，我们介绍了一系列深入增强学习（DRL）算法，以学习在使用这些机器人时提高安全性的控制政策。

The ability to recover from an unexpected external perturbation is a fundamental motor skill in bipedal locomotion. An effective response includes the ability to not just recover balance and maintain stability but also to fall in a safe manner when balance recovery is physically infeasible. For robots associated with bipedal locomotion, such as humanoid robots and assistive robotic devices that aid humans in walking, designing controllers which can provide this stability and safety can prevent damage to robots or prevent injury related medical costs. This is a challenging task because it involves generating highly dynamic motion for a high-dimensional, non-linear and under-actuated system with contacts. Despite prior advancements in using model-based and optimization methods, challenges such as requirement of extensive domain knowledge, relatively large computational time and limited robustness to changes in dynamics still make this an open problem. In this thesis, to address these issues we develop learning-based algorithms capable of synthesizing push recovery control policies for two different kinds of robots : Humanoid robots and assistive robotic devices that assist in bipedal locomotion. Our work can be branched into two closely related directions : 1) Learning safe falling and fall prevention strategies for humanoid robots and 2) Learning fall prevention strategies for humans using a robotic assistive devices. To achieve this, we introduce a set of Deep Reinforcement Learning (DRL) algorithms to learn control policies that improve safety while using these robots.

下载PDF全文

下载文献需遵守相关版权规定

论文标题