论文标题
域mix:学习通用的人重新识别而没有人类注释
DomainMix: Learning Generalizable Person Re-Identification Without Human Annotations
论文作者
论文摘要
现有人员的重新识别模型通常具有较低的概括性,这主要是由于培训中大型标记数据的可用性有限。但是,标记大规模培训数据非常昂贵且耗时,而大规模合成数据集则显示出有希望的学习概括的人重新识别模型的价值。因此,在本文中,提出了一个新颖而实用的人重新识别任务,即。如何使用标记的合成数据集和未标记的现实数据集来训练通用模型。这样,不再需要人类注释,并且可以扩展到大型和多样化的现实世界数据集。为了解决该任务,我们引入了一个具有高推广性的框架,即域mix。具体而言,所提出的方法首先将未标记的现实世界图像簇,并选择可靠的簇。在训练期间,为了解决两个域之间的较大域间隙,提出了一种域不变的特征学习方法,该方法引入了新的损失,即。域平衡损失,在域不变特征学习和域歧视之间进行对抗性学习,同时学习人重新识别的歧视性特征。这样,合成数据和现实世界数据之间的域差距大大降低,并且由于大规模和多样化的培训数据,学到的功能是可以推广的。实验结果表明,所提出的无注释方法或多或少与接受完整人类注释的对应者相提并论,这是非常有前途的。此外,在直接跨数据库评估下,它可以实现几个人重新识别数据集的当前艺术状态。
Existing person re-identification models often have low generalizability, which is mostly due to limited availability of large-scale labeled data in training. However, labeling large-scale training data is very expensive and time-consuming, while large-scale synthetic dataset shows promising value in learning generalizable person re-identification models. Therefore, in this paper a novel and practical person re-identification task is proposed,i.e. how to use labeled synthetic dataset and unlabeled real-world dataset to train a universal model. In this way, human annotations are no longer required, and it is scalable to large and diverse real-world datasets. To address the task, we introduce a framework with high generalizability, namely DomainMix. Specifically, the proposed method firstly clusters the unlabeled real-world images and selects the reliable clusters. During training, to address the large domain gap between two domains, a domain-invariant feature learning method is proposed, which introduces a new loss,i.e. domain balance loss, to conduct an adversarial learning between domain-invariant feature learning and domain discrimination, and meanwhile learns a discriminative feature for person re-identification. This way, the domain gap between synthetic and real-world data is much reduced, and the learned feature is generalizable thanks to the large-scale and diverse training data. Experimental results show that the proposed annotation-free method is more or less comparable to the counterpart trained with full human annotations, which is quite promising. In addition, it achieves the current state of the art on several person re-identification datasets under direct cross-dataset evaluation.