论文标题
PGCN:用于时空流量预测的渐进图卷积网络
PGCN: Progressive Graph Convolutional Networks for Spatial-Temporal Traffic Forecasting
论文作者
论文摘要
运输网络中复杂的时空相关性使流量预测问题具有挑战性。由于运输系统固有地具有图形结构,因此图形神经网络已经进行了许多研究工作。最近,构造数据的自适应图显示了依靠单个静态图结构的模型的有希望的结果。但是,在训练阶段进行了图适应,并且不能反映在测试阶段使用的数据。这种缺点可能是有问题的,尤其是在流量预测中,因为流量数据通常会遭受时间序列中意外的变化和不规则性。在这项研究中,我们提出了一个新型的流量预测框架,称为渐进图卷积网络(PGCN)。 PGCN通过在培训和测试阶段逐步适应在线输入数据来构建一组图。具体而言,我们通过学习趋势相似性在图节点之间实现了模型来构建渐进式邻接矩阵。然后,将模型与扩张的因果卷积和门控激活单元结合在一起,以提取时间特征。借助残留和跳过连接,PGCN执行流量预测。当应用于不同几何性质的七个现实世界流量数据集时,所提出的模型可以在所有数据集中达到最先进的性能。我们得出的结论是,PGCN逐渐适应输入数据的能力使该模型能够以鲁棒性在不同的研究站点中概括。
The complex spatial-temporal correlations in transportation networks make the traffic forecasting problem challenging. Since transportation system inherently possesses graph structures, many research efforts have been put with graph neural networks. Recently, constructing adaptive graphs to the data has shown promising results over the models relying on a single static graph structure. However, the graph adaptations are applied during the training phases and do not reflect the data used during the testing phases. Such shortcomings can be problematic especially in traffic forecasting since the traffic data often suffer from unexpected changes and irregularities in the time series. In this study, we propose a novel traffic forecasting framework called Progressive Graph Convolutional Network (PGCN). PGCN constructs a set of graphs by progressively adapting to online input data during the training and testing phases. Specifically, we implemented the model to construct progressive adjacency matrices by learning trend similarities among graph nodes. Then, the model is combined with the dilated causal convolution and gated activation unit to extract temporal features. With residual and skip connections, PGCN performs the traffic prediction. When applied to seven real-world traffic datasets of diverse geometric nature, the proposed model achieves state-of-the-art performance with consistency in all datasets. We conclude that the ability of PGCN to progressively adapt to input data enables the model to generalize in different study sites with robustness.