Robotslang基准：对话引导机器人本地化和导航

论文标题

Robotslang基准：对话引导机器人本地化和导航

The RobotSlang Benchmark: Dialog-guided Robot Localization and Navigation

论文作者

Banerjee, Shurjo, Thomason, Jesse, Corso, Jason J.

论文摘要

用于搜救到辅助指导的应用程序的自主机器人系统应该能够与人进行自然语言对话。为了研究这种合作的交流，我们介绍了机器人同时本地化和使用自然语言（Robotslang）的映射，这是控制机器人的人类驾驶员与人类指挥官之间的169个自然语言对话的基准，为导航目标提供指导。在每个试验中，两人首先合作将机器人定位在可见的全局地图上，然后驾驶员按照指挥官指令将机器人移动到目标对象的序列。我们介绍了对话框历史记录（LDH）的本地化和对话框历史记录（NDH）任务的导航，在该任务中，从机器人平台中给出了一个学习的代理作为输入的对话框和视觉观察，并且必须分别在全球地图中进行定位或朝下一个目标对象导航。 Robotslang由近5k的话语和超过1k分钟的机器人相机和控制流组成。我们提出了NDH任务的初始模型，并表明在模拟中训练的代理可以遵循基于Robotslang对话框的导航指令，用于控制物理机器人平台。代码和数据可在https://umrobotslang.github.io/上找到。

Autonomous robot systems for applications from search and rescue to assistive guidance should be able to engage in natural language dialog with people. To study such cooperative communication, we introduce Robot Simultaneous Localization and Mapping with Natural Language (RobotSlang), a benchmark of 169 natural language dialogs between a human Driver controlling a robot and a human Commander providing guidance towards navigation goals. In each trial, the pair first cooperates to localize the robot on a global map visible to the Commander, then the Driver follows Commander instructions to move the robot to a sequence of target objects. We introduce a Localization from Dialog History (LDH) and a Navigation from Dialog History (NDH) task where a learned agent is given dialog and visual observations from the robot platform as input and must localize in the global map or navigate towards the next target object, respectively. RobotSlang is comprised of nearly 5k utterances and over 1k minutes of robot camera and control streams. We present an initial model for the NDH task, and show that an agent trained in simulation can follow the RobotSlang dialog-based navigation instructions for controlling a physical robot platform. Code and data are available at https://umrobotslang.github.io/.

下载PDF全文

下载文献需遵守相关版权规定

论文标题