论文标题
通过自我组成学习发现人类对象互动概念
Discovering Human-Object Interaction Concepts via Self-Compositional Learning
论文作者
论文摘要
对人对象互动(HOI)的全面理解不仅需要检测一小部分预定义的HOI概念(或类别),还需要其他合理的HOI概念,而当前的方法通常无法探索一大批未知的HOI概念(即,动词和对象的合理组合,是不明的,但合理的组合)。在本文中,1)我们介绍了一项新颖而充满挑战的任务,以进行全面的HOI理解,这被称为HOI概念发现; 2)我们为HOI概念发现设计了一个自我复合学习框架(或SCL)。具体来说,我们在培训期间保持了在线更新的概念置信矩阵:1)根据自我训练的概念置信矩阵,我们为所有复合HOI实例分配了伪标记; 2)我们使用所有复合HOI实例的预测更新概念置信矩阵。因此,提出的方法可以对已知和未知的HOI概念进行学习。我们在几个流行的HOI数据集上进行了广泛的实验,以证明提出的HOI概念发现方法,对象负担识别和HOI检测的有效性。例如,提议的自我组成学习框架显着提高了1)HOI概念发现的性能,分别在HICO-DET上和V-Coco的3%以上。 2)在MS-Coco和Hico-Det上识别超过9%的地图对象识别; 3)稀有优先和非稀有的未知HOI检测分别相对30%和20%。代码可在https://github.com/zhihou7/hoi-cl上公开获取。
A comprehensive understanding of human-object interaction (HOI) requires detecting not only a small portion of predefined HOI concepts (or categories) but also other reasonable HOI concepts, while current approaches usually fail to explore a huge portion of unknown HOI concepts (i.e., unknown but reasonable combinations of verbs and objects). In this paper, 1) we introduce a novel and challenging task for a comprehensive HOI understanding, which is termed as HOI Concept Discovery; and 2) we devise a self-compositional learning framework (or SCL) for HOI concept discovery. Specifically, we maintain an online updated concept confidence matrix during training: 1) we assign pseudo-labels for all composite HOI instances according to the concept confidence matrix for self-training; and 2) we update the concept confidence matrix using the predictions of all composite HOI instances. Therefore, the proposed method enables the learning on both known and unknown HOI concepts. We perform extensive experiments on several popular HOI datasets to demonstrate the effectiveness of the proposed method for HOI concept discovery, object affordance recognition and HOI detection. For example, the proposed self-compositional learning framework significantly improves the performance of 1) HOI concept discovery by over 10% on HICO-DET and over 3% on V-COCO, respectively; 2) object affordance recognition by over 9% mAP on MS-COCO and HICO-DET; and 3) rare-first and non-rare-first unknown HOI detection relatively over 30% and 20%, respectively. Code is publicly available at https://github.com/zhihou7/HOI-CL.