魔鬼在标签中：可靠场景图的嘈杂标签校正

论文标题

魔鬼在标签中：可靠场景图的嘈杂标签校正

The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation

论文作者

Li, Lin, Chen, Long, Huang, Yifeng, Zhang, Zhimeng, Zhang, Songyang, Xiao, Jun

论文摘要

近年来，公正的SGG取得了重大进展。但是，几乎所有现有的SGG模型都忽略了盛行的SGG数据集的基本真相注释质量，即，他们始终假设：1）所有手动注释的阳性样本同样正确； 2）所有未注销的负样本绝对是背景。在本文中，我们认为这两个假设对SGG都是不适用的：有许多“嘈杂”的地面谓词标签破坏了这两个假设，这些嘈杂的样本实际上损害了无偏SGG模型的训练。为此，我们提出了一种新颖的模型敏捷噪声标签校正策略：nice。尼斯不仅可以检测到嘈杂的样本，而且还可以重新分配给它们的高质量谓词标签。经过良好的培训，我们可以获得更清洁的SGG数据集用于模型培训。具体而言，NICE由三个组成部分组成：负噪声样本检测（NEG-NSD），正NSD（POS-NSD）和嘈杂的样品校正（NSC）。首先，在NEG-NSD中，我们将此任务制定为分布外检测问题，并将伪标签分配给所有检测到的嘈杂的负样本。然后，在POS-NSD中，我们使用基于聚类的算法将所有正样品分为多组，并将最嘈杂集中的样品视为嘈杂的阳性样品。最后，在NSC中，我们使用简单但有效的加权KNN将新的谓词标签重新分配给嘈杂的阳性样品。不同的骨干和任务的广泛结果证明了NICE的每个组成部分的有效性和概括能力。

Unbiased SGG has achieved significant progress over recent years. However, almost all existing SGG models have overlooked the ground-truth annotation qualities of prevailing SGG datasets, i.e., they always assume: 1) all the manually annotated positive samples are equally correct; 2) all the un-annotated negative samples are absolutely background. In this paper, we argue that both assumptions are inapplicable to SGG: there are numerous "noisy" groundtruth predicate labels that break these two assumptions, and these noisy samples actually harm the training of unbiased SGG models. To this end, we propose a novel model-agnostic NoIsy label CorrEction strategy for SGG: NICE. NICE can not only detect noisy samples but also reassign more high-quality predicate labels to them. After the NICE training, we can obtain a cleaner version of SGG dataset for model training. Specifically, NICE consists of three components: negative Noisy Sample Detection (Neg-NSD), positive NSD (Pos-NSD), and Noisy Sample Correction (NSC). Firstly, in Neg-NSD, we formulate this task as an out-of-distribution detection problem, and assign pseudo labels to all detected noisy negative samples. Then, in Pos-NSD, we use a clustering-based algorithm to divide all positive samples into multiple sets, and treat the samples in the noisiest set as noisy positive samples. Lastly, in NSC, we use a simple but effective weighted KNN to reassign new predicate labels to noisy positive samples. Extensive results on different backbones and tasks have attested to the effectiveness and generalization abilities of each component of NICE.

下载PDF全文

下载文献需遵守相关版权规定

论文标题