深层合奏的汇总分布预测

论文标题

深层合奏的汇总分布预测

Aggregating distribution forecasts from deep ensembles

论文作者

Schulz, Benedikt, Köhler, Lutz, Lerch, Sebastian

论文摘要

准确量化预测不确定性的重要性促使人们对概率预测的最新研究。特别是，已经提出了各种深度学习方法，并作为神经网络的输出获得了预测分布。这些基于神经网络的方法通常以合奏的形式使用，例如，基于从不同的随机初始化或更复杂的结合策略（例如辍学）的多个模型运行，从而收集了需要将需要汇总到最终概率预测中的预测分布。为了合并有关集合方法的机器学习文献和有关预测组合的统计文献的发现，我们解决了如何基于此类“深层集合”汇总分布预测的问题。使用理论参数和对十二个基准数据集的全面分析，我们将系统地比较基于概率和分位数的聚合方法，用于三种基于神经网络的方法，具有不同的预测分布类型为输出。我们的结果表明，将深层集合的预测分布相结合可以大大提高预测性能。我们为深层集合提出了一个通用的分位数聚合框架，该框架允许对系统缺陷进行校正并在各种环境中表现良好，通常与预测密度的线性组合相比，通常优越。最后，我们研究了整体规模的效果，并得出了从实践中深层合奏中汇总分布预测的建议。

The importance of accurately quantifying forecast uncertainty has motivated much recent research on probabilistic forecasting. In particular, a variety of deep learning approaches has been proposed, with forecast distributions obtained as output of neural networks. These neural network-based methods are often used in the form of an ensemble, e.g., based on multiple model runs from different random initializations or more sophisticated ensembling strategies such as dropout, resulting in a collection of forecast distributions that need to be aggregated into a final probabilistic prediction. With the aim of consolidating findings from the machine learning literature on ensemble methods and the statistical literature on forecast combination, we address the question of how to aggregate distribution forecasts based on such `deep ensembles'. Using theoretical arguments and a comprehensive analysis on twelve benchmark data sets, we systematically compare probability- and quantile-based aggregation methods for three neural network-based approaches with different forecast distribution types as output. Our results show that combining forecast distributions from deep ensembles can substantially improve the predictive performance. We propose a general quantile aggregation framework for deep ensembles that allows for corrections of systematic deficiencies and performs well in a variety of settings, often superior compared to a linear combination of the forecast densities. Finally, we investigate the effects of the ensemble size and derive recommendations of aggregating distribution forecasts from deep ensembles in practice.

下载PDF全文

下载文献需遵守相关版权规定

论文标题