论文标题
通过几何图形匹配改善基于特征的视觉定位
Improving Feature-based Visual Localization by Geometry-Aided Matching
论文作者
论文摘要
特征匹配对于视觉定位至关重要,其中2D-3D对应关系在确定相机姿势的准确性中起着重要作用。由于噪声而导致的准确姿势估计,足够数量的良好分布的2D-3D对应关系至关重要。但是,现有的2d-3d功能匹配方法依赖于在特征空间中找到最近的邻居,并使用手工制作的启发式方法删除离群值,这可能会导致潜在的匹配或被过滤的正确匹配。在这项工作中,我们提出了一种称为几何辅助匹配(GAM)的新颖方法,该方法同时结合了外观信息和几何环境,以解决此问题并改善2D-3D功能匹配。 GAM可以大大提高2D-3D比赛的召回,同时保持高精度。我们将GAM应用于新的分层视觉定位管道,并表明GAM可以有效地提高本地化的鲁棒性和准确性。广泛的实验表明,与手工制作的启发式和学习基线相比,GAM可以找到更多的真实匹配。我们提出的本地化方法在多个视觉定位数据集上实现了最新的结果。在剑桥地标数据集上进行的实验表明,我们的方法的表现优于现有的最新方法,并且比表现最好的方法快六倍。源代码可在https://github.com/openxrlab/xrlocalization上获得。
Feature matching is crucial in visual localization, where 2D-3D correspondence plays a major role in determining the accuracy of camera pose. A sufficient number of well-distributed 2D-3D correspondences is essential for accurate pose estimation due to noise. However, existing 2D-3D feature matching methods rely on finding nearest neighbors in the feature space and removing outliers using hand-crafted heuristics, which may lead to potential matches being missed or the correct matches being filtered out. In this work, we propose a novel method called Geometry-Aided Matching (GAM), which incorporates both appearance information and geometric context to address this issue and to improve 2D-3D feature matching. GAM can greatly boost the recall of 2D-3D matches while maintaining high precision. We apply GAM to a new hierarchical visual localization pipeline and show that GAM can effectively improve the robustness and accuracy of localization. Extensive experiments show that GAM can find more real matches than hand-crafted heuristics and learning baselines. Our proposed localization method achieves state-of-the-art results on multiple visual localization datasets. Experiments on Cambridge Landmarks dataset show that our method outperforms the existing state-of-the-art methods and is six times faster than the top-performed method. The source code is available at https://github.com/openxrlab/xrlocalization.