DAS：深度度量学习密集的抽样

论文标题

DAS：深度度量学习密集的抽样

DAS: Densely-Anchored Sampling for Deep Metric Learning

论文作者

Liu, Lizhao, Huang, Shangxin, Zhuang, Zhuangwei, Yang, Ran, Tan, Mingkui, Wang, Yaowei

论文摘要

深度度量学习（DML）有助于学习嵌入功能，以将语义相似的数据投射到附近的嵌入空间中，并在许多应用中起着至关重要的作用，例如图像检索和面部识别。但是，DML方法的性能通常很大程度上取决于采样方法，从训练中的嵌入空间中选择有效的数据。在实践中，嵌入空间中的嵌入是通过某些深层模型获得的，在某些深层模型中，由于没有训练点，嵌入空间通常存在贫瘠的区域，从而导致所谓的“缺失嵌入”问题。此问题可能会损害样品质量，从而导致DML性能退化。在这项工作中，我们研究了如何减轻“缺失”问题以提高采样质量并实现有效的DML。为此，我们提出了一个密集锚定的采样（DAS）方案，该方案将相应的数据点嵌入为“锚”，并利用锚附近的嵌入空间来密集地生成无数据点的嵌入。具体而言，我们建议用歧视性特征缩放（DFS）和记忆转换转换（MTS）的多个锚来利用单个锚周围的嵌入空间。通过这种方式，通过有或没有数据点的嵌入方式，我们能够提供更多的嵌入以促进采样过程，从而提高DML的性能。我们的方法毫不费力地整合到现有的DML框架中，并在没有铃铛和哨子的情况下改进了它们。在三个基准数据集上进行的广泛实验证明了我们方法的优越性。

Deep Metric Learning (DML) serves to learn an embedding function to project semantically similar data into nearby embedding space and plays a vital role in many applications, such as image retrieval and face recognition. However, the performance of DML methods often highly depends on sampling methods to choose effective data from the embedding space in the training. In practice, the embeddings in the embedding space are obtained by some deep models, where the embedding space is often with barren area due to the absence of training points, resulting in so called "missing embedding" issue. This issue may impair the sample quality, which leads to degenerated DML performance. In this work, we investigate how to alleviate the "missing embedding" issue to improve the sampling quality and achieve effective DML. To this end, we propose a Densely-Anchored Sampling (DAS) scheme that considers the embedding with corresponding data point as "anchor" and exploits the anchor's nearby embedding space to densely produce embeddings without data points. Specifically, we propose to exploit the embedding space around single anchor with Discriminative Feature Scaling (DFS) and multiple anchors with Memorized Transformation Shifting (MTS). In this way, by combing the embeddings with and without data points, we are able to provide more embeddings to facilitate the sampling process thus boosting the performance of DML. Our method is effortlessly integrated into existing DML frameworks and improves them without bells and whistles. Extensive experiments on three benchmark datasets demonstrate the superiority of our method.

下载PDF全文

下载文献需遵守相关版权规定

论文标题