通过动态加权正则化学习跨视图的地理位置嵌入嵌入

论文标题

通过动态加权正则化学习跨视图的地理位置嵌入嵌入

Learning Cross-view Geo-localization Embeddings via Dynamic Weighted Decorrelation Regularization

论文作者

Wang, Tingyu, Zheng, Zhedong, Zhu, Zunjie, Gao, Yuhan, Yang, Yi, Yan, Chenggang

论文摘要

跨视图地理定位旨在发现从两个平台（例如无人机平台和卫星平台）拍摄相同位置的图像。现有方法通常着重于优化一个在特征空间中与其他嵌入的距离，同时忽略了嵌入本身的冗余。在本文中，我们认为低冗余也很重要，这激发了模型开采更多样化的模式。为了验证这一点，我们引入了一个简单而有效的正则化，即动态加权去相关（DWDR），以明确鼓励网络学习独立的嵌入渠道。顾名思义，DWDR会将嵌入相关系数矩阵回归到稀疏矩阵，即具有动态权重的身份矩阵。动态权重用于训练期间仍然相关的通道。此外，我们提出了一个跨视图对称抽样策略，该策略使示例保持平衡不同平台之间。尽管很简单，但提出的方法已在三个大规模基准（即大学1652，CVUSA和CVACT）上取得了竞争成果。此外，在恶劣的情况下，例如，64个维度的极短特征，所提出的方法超过了基线模型。

Cross-view geo-localization aims to spot images of the same location shot from two platforms, e.g., the drone platform and the satellite platform. Existing methods usually focus on optimizing the distance between one embedding with others in the feature space, while neglecting the redundancy of the embedding itself. In this paper, we argue that the low redundancy is also of importance, which motivates the model to mine more diverse patterns. To verify this point, we introduce a simple yet effective regularization, i.e., Dynamic Weighted Decorrelation Regularization (DWDR), to explicitly encourage networks to learn independent embedding channels. As the name implies, DWDR regresses the embedding correlation coefficient matrix to a sparse matrix, i.e., the identity matrix, with dynamic weights. The dynamic weights are applied to focus on still correlated channels during training. Besides, we propose a cross-view symmetric sampling strategy, which keeps the example balance between different platforms. Albeit simple, the proposed method has achieved competitive results on three large-scale benchmarks, i.e., University-1652, CVUSA and CVACT. Moreover, under the harsh circumstance, e.g., the extremely short feature of 64 dimensions, the proposed method surpasses the baseline model by a clear margin.

下载PDF全文

下载文献需遵守相关版权规定

论文标题