使用循环一致性的自我监督光场视图合成

论文标题

使用循环一致性的自我监督光场视图合成

Self-supervised Light Field View Synthesis Using Cycle Consistency

论文作者

Chen, Yang, Alain, Martin, Smolic, Aljosa

论文摘要

高角度分辨率对于光场的实际应用是有利的。为了增强光场的角度分辨率，可以利用视图合成方法从稀疏的光场输入中生成密集的中间视图。大多数成功的视图合成方法是基于学习的方法，这些方法需要大量的培训数据与地面真理配对。但是，与自然图像或视频相比，为光场收集如此大的数据集具有挑战性。为了解决这个问题，我们提出了一个以循环一致性为单位的自我监管的光场视图综合框架。所提出的方法旨在将从高质量的自然视频数据集中学到的先验知识转移到光场视图综合任务，从而减少了对标记的光场数据的需求。周期一致性约束用于构建双向映射，从而强制执行与输入视图一致的生成视图。从这个关键概念中得出，两个损失函数，周期损失和重建损失可用于微调最先进的视频插值方法的预训练模型。在各种数据集上评估了所提出的方法，以验证其鲁棒性，结果表明，与监督的微调相比，它不仅可以实现竞争性能，而且表现优于最先进的光场视图合成方法，尤其是在生成多个中间视图时。此外，我们的通用光场视图合成框架可以用于任何预训练的模型，以进行高级视频插值。

High angular resolution is advantageous for practical applications of light fields. In order to enhance the angular resolution of light fields, view synthesis methods can be utilized to generate dense intermediate views from sparse light field input. Most successful view synthesis methods are learning-based approaches which require a large amount of training data paired with ground truth. However, collecting such large datasets for light fields is challenging compared to natural images or videos. To tackle this problem, we propose a self-supervised light field view synthesis framework with cycle consistency. The proposed method aims to transfer prior knowledge learned from high quality natural video datasets to the light field view synthesis task, which reduces the need for labeled light field data. A cycle consistency constraint is used to build bidirectional mapping enforcing the generated views to be consistent with the input views. Derived from this key concept, two loss functions, cycle loss and reconstruction loss, are used to fine-tune the pre-trained model of a state-of-the-art video interpolation method. The proposed method is evaluated on various datasets to validate its robustness, and results show it not only achieves competitive performance compared to supervised fine-tuning, but also outperforms state-of-the-art light field view synthesis methods, especially when generating multiple intermediate views. Besides, our generic light field view synthesis framework can be adopted to any pre-trained model for advanced video interpolation.

下载PDF全文

下载文献需遵守相关版权规定

论文标题