论文标题
用于机器人手臂控制
Variational Quantum Soft Actor-Critic for Robotic Arm Control
论文作者
论文摘要
深入的强化学习正在成为机器人手臂运动持续控制任务的一种有希望的方法。但是,学习强大和多功能控制能力的学习挑战远未解决现实世界的应用,这主要是因为该学习范式的两个常见问题:探索策略和缓慢的学习速度,有时被称为“维度诅咒”。这项工作旨在探索和评估量子计算应用于连续控制的最先进的强化学习技术之一的优势 - 即软角色批评。具体而言,通过量子电路的数字模拟,已经研究了差异量子软角色批评者对虚拟机器人臂运动的性能。根据令人满意的模型训练所需参数的数量显着减少,已经发现了比经典算法的量子优势,为进一步的有希望的发展铺平了道路。
Deep Reinforcement Learning is emerging as a promising approach for the continuous control task of robotic arm movement. However, the challenges of learning robust and versatile control capabilities are still far from being resolved for real-world applications, mainly because of two common issues of this learning paradigm: the exploration strategy and the slow learning speed, sometimes known as "the curse of dimensionality". This work aims at exploring and assessing the advantages of the application of Quantum Computing to one of the state-of-art Reinforcement Learning techniques for continuous control - namely Soft Actor-Critic. Specifically, the performance of a Variational Quantum Soft Actor-Critic on the movement of a virtual robotic arm has been investigated by means of digital simulations of quantum circuits. A quantum advantage over the classical algorithm has been found in terms of a significant decrease in the amount of required parameters for satisfactory model training, paving the way for further promising developments.