NeuralMagiceye：学会看到和理解自动图背后的场景

论文标题

NeuralMagiceye：学会看到和理解自动图背后的场景

NeuralMagicEye: Learning to See and Understand the Scene Behind an Autostereogram

论文作者

Zou, Zhengxia, Shi, Tianyang, Yuan, Yi, Shi, Zhenwei

论文摘要

Autosteregram（又名Magic Eye Image）是一个单像立体图，可以从2D纹理产生3D场景的视觉错觉。本文研究了一个有趣的问题，即是否可以训练深入的CNN来恢复自动图背后的深度并了解其内容。自动图魔术的关键在于立体声 - 为了解决这样的问题，模型必须学会从准周期纹理发现和估计差异。我们表明，嵌入了Disparity卷积的深CNN，这是本文中提出的一种新颖的卷积层，模拟了立体声和编码差异，可以很好地解决此类问题，因为在以自我服务的方式接受了一个大型3D对象数据集对大型3D对象数据集进行了充分的培训。我们将我们的方法称为``neuralmagiceye''。实验表明，我们的方法可以通过丰富的细节和梯度平滑度准确地恢复自动图背后的深度。实验还显示了神经网络和人眼中的自动图感知的完全不同的工作机制。我们希望这项研究能够帮助有视觉障碍的人和难以查看自动图的人。我们的代码可在\ url {https://jiupinjia.github.io/neuralmagiceye/}中获得。

An autostereogram, a.k.a. magic eye image, is a single-image stereogram that can create visual illusions of 3D scenes from 2D textures. This paper studies an interesting question that whether a deep CNN can be trained to recover the depth behind an autostereogram and understand its content. The key to the autostereogram magic lies in the stereopsis - to solve such a problem, a model has to learn to discover and estimate disparity from the quasi-periodic textures. We show that deep CNNs embedded with disparity convolution, a novel convolutional layer proposed in this paper that simulates stereopsis and encodes disparity, can nicely solve such a problem after being sufficiently trained on a large 3D object dataset in a self-supervised fashion. We refer to our method as ``NeuralMagicEye''. Experiments show that our method can accurately recover the depth behind autostereograms with rich details and gradient smoothness. Experiments also show the completely different working mechanisms for autostereogram perception between neural networks and human eyes. We hope this research can help people with visual impairments and those who have trouble viewing autostereograms. Our code is available at \url{https://jiupinjia.github.io/neuralmagiceye/}.

下载PDF全文

下载文献需遵守相关版权规定

论文标题