论文标题
Perspective-1- ellipsoid:来自一个椭圆形的相应对应关系的相机姿势估计问题的配方,分析和解决方案
Perspective-1-Ellipsoid: Formulation, Analysis and Solutions of the Camera Pose Estimation Problem from One Ellipse-Ellipsoid Correspondence
论文作者
论文摘要
在计算机视觉中,从3D几何实体之间的对应关系及其对图像的投影进行了摄像头姿势估计已被广泛研究。尽管大多数最先进的方法利用了诸如点或线之类的低级原始方法,但近年来非常有效的基于CNN的对象探测器的出现为使用具有有意义信息的高级功能铺平了道路。开拓性朝这个方向起作用,表明通过椭圆形对3D对象进行建模和椭圆检测的2D检测提供了方便的方式来链接2D和3D数据。但是,在相关垃圾中最常使用的数学形式主义不能轻易将椭圆形和椭圆形和其他四肢和圆锥形区分开,从而导致某些发展中可能有害的特异性丧失。此外,投影方程的线性化过程产生了相机参数的过度代表,也可能导致效率损失。因此,在本文中,我们引入了一个特定于椭圆形的理论框架,并在姿势估计的背景下证明了其有益的特性。更确切地说,我们首先表明拟议的形式主义使姿势估计问题可以将姿势估计问题减少到仅在封闭形式中得出其余未知数的位置或方向估计问题。然后,我们证明它可以进一步简化为1个自由度(1DOF)问题,并提供姿势的分析推导,这是该独特标量未知的函数。我们通过视觉示例说明了我们的理论考虑因素,并包括有关实际方面的讨论。最后,我们将本文与相应的源代码一起发布,以便有助于更有效的椭圆形相关姿势估计问题的分辨率。
In computer vision, camera pose estimation from correspondences between 3D geometric entities and their projections into the image has been a widely investigated problem. Although most state-of-the-art methods exploit low-level primitives such as points or lines, the emergence of very effective CNN-based object detectors in the recent years has paved the way to the use of higher-level features carrying semantically meaningful information. Pioneering works in that direction have shown that modelling 3D objects by ellipsoids and 2D detections by ellipses offers a convenient manner to link 2D and 3D data. However, the mathematical formalism most often used in the related litterature does not enable to easily distinguish ellipsoids and ellipses from other quadrics and conics, leading to a loss of specificity potentially detrimental in some developments. Moreover, the linearization process of the projection equation creates an over-representation of the camera parameters, also possibly causing an efficiency loss. In this paper, we therefore introduce an ellipsoid-specific theoretical framework and demonstrate its beneficial properties in the context of pose estimation. More precisely, we first show that the proposed formalism enables to reduce the pose estimation problem to a position or orientation-only estimation problem in which the remaining unknowns can be derived in closed-form. Then, we demonstrate that it can be further reduced to a 1 Degree-of-Freedom (1DoF) problem and provide the analytical derivations of the pose as a function of that unique scalar unknown. We illustrate our theoretical considerations by visual examples and include a discussion on the practical aspects. Finally, we release this paper along with the corresponding source code in order to contribute towards more efficient resolutions of ellipsoid-related pose estimation problems.