论文标题
通过多标签投影快速在线散列
Fast Online Hashing with Multi-Label Projection
论文作者
论文摘要
由于其时间和存储优势,Hashhing已广泛研究以解决大规模的近似邻居搜索问题。近年来,出现了许多在线哈希方法,可以更新哈希功能以适应新的流数据并实现动态检索。但是,现有的在线哈希方法需要在查询到达时使用最新的哈希功能更新整个数据库,这会导致较低的检索效率,而流数据的持续增加。另一方面,这些方法忽略了示例之间的监督关系,尤其是在多标签情况下。在本文中,我们提出了一种新颖的快速在线哈希(FOH)方法,该方法仅更新数据库的一小部分的二进制代码。具体来说,我们首先建立一个查询池,其中记录了每个中心点的最近邻居。当新查询到达时,仅更新相应潜在邻居的二进制代码。此外,我们创建了一个相似性矩阵,该矩阵将多标签监督信息考虑在内,并引入多标签投影损失,以进一步保留多标签数据之间的相似性。两个共同基准的实验结果表明,所提出的FOH可以在查询时间上比具有竞争力的检索准确性的查询时间高达6.28秒。
Hashing has been widely researched to solve the large-scale approximate nearest neighbor search problem owing to its time and storage superiority. In recent years, a number of online hashing methods have emerged, which can update the hash functions to adapt to the new stream data and realize dynamic retrieval. However, existing online hashing methods are required to update the whole database with the latest hash functions when a query arrives, which leads to low retrieval efficiency with the continuous increase of the stream data. On the other hand, these methods ignore the supervision relationship among the examples, especially in the multi-label case. In this paper, we propose a novel Fast Online Hashing (FOH) method which only updates the binary codes of a small part of the database. To be specific, we first build a query pool in which the nearest neighbors of each central point are recorded. When a new query arrives, only the binary codes of the corresponding potential neighbors are updated. In addition, we create a similarity matrix which takes the multi-label supervision information into account and bring in the multi-label projection loss to further preserve the similarity among the multi-label data. The experimental results on two common benchmarks show that the proposed FOH can achieve dramatic superiority on query time up to 6.28 seconds less than state-of-the-art baselines with competitive retrieval accuracy.