论文标题
ProAlignnet:无监督的学习,以逐步对齐嘈杂的轮廓
ProAlignNet : Unsupervised Learning for Progressively Aligning Noisy Contours
论文作者
论文摘要
轮廓形状对齐是计算机视觉中的一个基本但具有挑战性的问题,尤其是当观察结果是部分,嘈杂且在很大程度上被错位时。最新的基于Convnet的架构被提议与形状的轮廓表示,这些结构往往会失败,这主要是由于使用近端不敏感的像素相似性指标作为其训练过程中的损失函数。这项工作提出了一种新颖的Convnet,“ ProAlignnet”,它说明了轮廓形状之间的大规模未对准和复杂的转换。它以多尺度的方式渗透了经线参数,并且在增加的尺度上逐渐增加了复杂的转换。它通过使用新型的损耗功能进行训练,从而学习 - 没有监督 - 与噪声和缺失的部分保持一致,而噪声和丢失的部位是通过新型损耗函数进行训练,该损失功能是邻近敏感性和局部形状依赖性相似性指标的上行,该指标使用经典的形态变化距离距离变换。我们通过一些基本的理智检查实验评估了这些提案在模拟MNIST嘈杂轮廓数据集上的可靠性。接下来,我们证明了在(i)将地理标准数据对齐到航空图像图和(ii)精炼精巧注释的分割标签的两个现实世界应用中提出的模型的有效性。在这两种应用中,提出的模型始终如一地执行优于最新方法。
Contour shape alignment is a fundamental but challenging problem in computer vision, especially when the observations are partial, noisy, and largely misaligned. Recent ConvNet-based architectures that were proposed to align image structures tend to fail with contour representation of shapes, mostly due to the use of proximity-insensitive pixel-wise similarity measures as loss functions in their training processes. This work presents a novel ConvNet, "ProAlignNet" that accounts for large scale misalignments and complex transformations between the contour shapes. It infers the warp parameters in a multi-scale fashion with progressively increasing complex transformations over increasing scales. It learns --without supervision-- to align contours, agnostic to noise and missing parts, by training with a novel loss function which is derived an upperbound of a proximity-sensitive and local shape-dependent similarity metric that uses classical Morphological Chamfer Distance Transform. We evaluate the reliability of these proposals on a simulated MNIST noisy contours dataset via some basic sanity check experiments. Next, we demonstrate the effectiveness of the proposed models in two real-world applications of (i) aligning geo-parcel data to aerial image maps and (ii) refining coarsely annotated segmentation labels. In both applications, the proposed models consistently perform superior to state-of-the-art methods.