论文标题
通过经常学习的自适应视频流进行自适应视频流的预测
Convex Hull Prediction for Adaptive Video Streaming by Recurrent Learning
论文作者
论文摘要
自适应视频流依靠构建高效的比特率梯子来在带宽约束下为观众提供最佳的视觉质量。与内容相关的比特阶梯选择的传统方法需要预先编码多个编码参数,以找到由结果质量曲线的凸面船体给出的最佳操作点。但是,此预编码步骤等于在可能的编码参数的整个空间上的详尽搜索过程,这在计算和时间支出方面都会引起大量开销。为了减少此开销,我们提出了一种基于深度学习的内容的凸面预测的方法。我们采用经常性的卷积网络(RCN)来隐式分析视频拍摄的时空复杂性,以预测其凸壳。采用了两步转移学习方案来培训我们提出的RCN救主模型,该模型确保了足够的内容多样性来分析场景复杂性,同时也可以捕获原始源视频的场景统计信息。我们的实验结果表明,我们提出的模型可以更好地近似最佳的凸壳,并且与现有方法相比,可以节省竞争力的时间。平均而言,我们的方法将预编码时间缩短了53.8%,而预测的凸面壳的平均Bjontegaard Delta Bitrate(BD速率)对地面真理为0.26%,而BD率分布的平均绝对偏差为0.57%。
Adaptive video streaming relies on the construction of efficient bitrate ladders to deliver the best possible visual quality to viewers under bandwidth constraints. The traditional method of content dependent bitrate ladder selection requires a video shot to be pre-encoded with multiple encoding parameters to find the optimal operating points given by the convex hull of the resulting rate-quality curves. However, this pre-encoding step is equivalent to an exhaustive search process over the space of possible encoding parameters, which causes significant overhead in terms of both computation and time expenditure. To reduce this overhead, we propose a deep learning based method of content aware convex hull prediction. We employ a recurrent convolutional network (RCN) to implicitly analyze the spatiotemporal complexity of video shots in order to predict their convex hulls. A two-step transfer learning scheme is adopted to train our proposed RCN-Hull model, which ensures sufficient content diversity to analyze scene complexity, while also making it possible to capture the scene statistics of pristine source videos. Our experimental results reveal that our proposed model yields better approximations of the optimal convex hulls, and offers competitive time savings as compared to existing approaches. On average, the pre-encoding time was reduced by 53.8% by our method, while the average Bjontegaard delta bitrate (BD-rate) of the predicted convex hulls against ground truth was 0.26%, and the mean absolute deviation of the BD-rate distribution was 0.57%.