论文标题
基于小波的360度视频的快速解码
Wavelet-Based Fast Decoding of 360-Degree Videos
论文作者
论文摘要
在本文中,我们提出了一个基于小波的视频编解码器,专门为VR显示器设计,可实时播放高分辨率360°视频。我们的编解码器利用了这样一个事实,即随时在显示屏上只能看到完整360°视频框架的一小部分。为了实时加载和解码视频视口,我们利用小波变换进行内部和框架间编码。因此,相关内容是直接从驱动器流中流的,而无需将整个帧保持在内存中。在8192x8192像素全帧分辨率的平均每秒193帧的情况下,进行的评估表明,对于典型的VR显示器,我们的编解码器的解码性能比最先进的视频编解码器H.265和AV1高272%。通过一项感知研究,我们进一步说明了具有更好的VR体验的高帧速率的必要性。最后,我们演示了如何直接将基于小波的编解码器与Foveation结合使用,以进一步提高性能。
In this paper, we propose a wavelet-based video codec specifically designed for VR displays that enables real-time playback of high-resolution 360° videos. Our codec exploits the fact that only a fraction of the full 360° video frame is visible on the display at any time. To load and decode the video viewport-dependently in real time, we make use of the wavelet transform for intra- as well as inter-frame coding. Thereby, the relevant content is directly streamed from the drive, without the need to hold the entire frames in memory. With an average of 193 frames per second at 8192x8192-pixel full-frame resolution, the conducted evaluation demonstrates that our codec's decoding performance is up to 272% higher than that of the state-of-the-art video codecs H.265 and AV1 for typical VR displays. By means of a perceptual study, we further illustrate the necessity of high frame rates for a better VR experience. Finally, we demonstrate how our wavelet-based codec can also directly be used in conjunction with foveation for further performance increase.