论文标题
在看不见的孔中学习基于模拟的现实世界钉的视觉政策
Learning A Simulation-based Visual Policy for Real-world Peg In Unseen Holes
论文作者
论文摘要
本文提出了一个基于学习的视觉钉孔,可以在模拟中具有多种形状的培训,并适应现实世界中的任意看不见的形状,其成本最低。核心思想是将感觉运动策略的概括分解为快速适应感知模块和模拟通用策略模块的设计。该框架由分割网络(SN),虚拟传感器网络(VSN)和控制器网络(CN)组成。具体而言,对VSN进行了训练,可以从分段图像中测量看不见的形状的姿势。之后,鉴于形状不足的姿势测量,CN进行了训练以实现通用钉孔。最后,当应用于实际看不见的孔时,我们只需要微调模拟VSN+CN所需的SN即可。为了进一步最大程度地减少转移成本,我们建议在一分钟的人类教学后自动收集和注释SN的数据。模拟和现实世界的结果在眼前的配置下呈现。带有拟议政策内部政策的电动汽车充电系统在2-3s中获得了10/10的成功率,仅使用数百个自动标记的样品进行SN传输。
This paper proposes a learning-based visual peg-in-hole that enables training with several shapes in simulation, and adapting to arbitrary unseen shapes in real world with minimal sim-to-real cost. The core idea is to decouple the generalization of the sensory-motor policy to the design of a fast-adaptable perception module and a simulated generic policy module. The framework consists of a segmentation network (SN), a virtual sensor network (VSN), and a controller network (CN). Concretely, the VSN is trained to measure the pose of the unseen shape from a segmented image. After that, given the shape-agnostic pose measurement, the CN is trained to achieve generic peg-in-hole. Finally, when applying to real unseen holes, we only have to fine-tune the SN required by the simulated VSN+CN. To further minimize the transfer cost, we propose to automatically collect and annotate the data for the SN after one-minute human teaching. Simulated and real-world results are presented under the configurations of eye-to/in-hand. An electric vehicle charging system with the proposed policy inside achieves a 10/10 success rate in 2-3s, using only hundreds of auto-labeled samples for the SN transfer.