论文标题
部分可观测时空混沌系统的无模型预测
One-Shot Open-Set Skeleton-Based Action Recognition
论文作者
论文摘要
动作识别是人形机器人与人类互动和合作的基本能力。该应用程序需要设计动作识别系统,以便可以轻松添加新操作,同时识别和忽略未知的动作。近年来,深度学习的方法代表了行动识别问题的主要解决方案。但是,大多数模型通常需要大量的手动标记样品数据集。在这项工作中,我们针对一声深度学习的模型,因为它们只能处理课堂的一个实例。不幸的是,一击模型假设在推理时,识别的动作落入了支持集中,并且当动作位于支撑设置之外时,它们就会失败。几乎没有射击开放式识别(FSOSR)解决方案试图解决该缺陷,但是当前的解决方案仅考虑静态图像而不是图像序列。静态图像仍然不足以区分诸如坐下和站立之类的动作。在本文中,我们提出了一个新型模型,该模型通过一个单发模型来解决FSOSR问题,该模型用拒绝未知动作的歧视器增强。该模型对于人体机器人技术中的应用很有用,因为它允许轻松添加新类并确定输入序列是否是系统已知的序列。我们展示了如何以端到端的方式训练整个模型,并进行定量和定性分析。最后,我们提供现实世界中的例子。
Action recognition is a fundamental capability for humanoid robots to interact and cooperate with humans. This application requires the action recognition system to be designed so that new actions can be easily added, while unknown actions are identified and ignored. In recent years, deep-learning approaches represented the principal solution to the Action Recognition problem. However, most models often require a large dataset of manually-labeled samples. In this work we target One-Shot deep-learning models, because they can deal with just a single instance for class. Unfortunately, One-Shot models assume that, at inference time, the action to recognize falls into the support set and they fail when the action lies outside the support set. Few-Shot Open-Set Recognition (FSOSR) solutions attempt to address that flaw, but current solutions consider only static images and not sequences of images. Static images remain insufficient to discriminate actions such as sitting-down and standing-up. In this paper we propose a novel model that addresses the FSOSR problem with a One-Shot model that is augmented with a discriminator that rejects unknown actions. This model is useful for applications in humanoid robotics, because it allows to easily add new classes and determine whether an input sequence is among the ones that are known to the system. We show how to train the whole model in an end-to-end fashion and we perform quantitative and qualitative analyses. Finally, we provide real-world examples.