论文标题
sting:基于自我注意的时间序列插补网络使用GAN
STING: Self-attention based Time-series Imputation Networks using GAN
论文作者
论文摘要
时间序列数据在现实世界应用中无处不在。但是,最常见的问题之一是,时间序列数据可能会通过数据收集过程的固有性质丢失值。因此,必须从多元(相关)时间序列数据中推出丢失的值,即在做出准确的数据驱动决策的同时,必须提高预测性能。插补的常规作品只需删除缺失值或根据平均/零填充它们。尽管基于深度神经网络的最新作品显示出了显着的结果,但它们仍然有限制捕获多元时间序列的复杂生成过程。在本文中,我们提出了一种用于多变量时间序列数据的新型插补方法,称为sting(使用GAN的基于自我注意的时间序列插补网络)。我们利用生成的对抗网络和双向复发性神经网络来学习时间序列的潜在表示。此外,我们还引入了一种新型的注意机制,以捕获整个序列的加权相关性,并避免无关的偏差。三个现实世界数据集的实验结果表明,在插补精度以及其下游任务方面,刺痛的表现优于现有的最新方法。
Time series data are ubiquitous in real-world applications. However, one of the most common problems is that the time series data could have missing values by the inherent nature of the data collection process. So imputing missing values from multivariate (correlated) time series data is imperative to improve a prediction performance while making an accurate data-driven decision. Conventional works for imputation simply delete missing values or fill them based on mean/zero. Although recent works based on deep neural networks have shown remarkable results, they still have a limitation to capture the complex generation process of the multivariate time series. In this paper, we propose a novel imputation method for multivariate time series data, called STING (Self-attention based Time-series Imputation Networks using GAN). We take advantage of generative adversarial networks and bidirectional recurrent neural networks to learn latent representations of the time series. In addition, we introduce a novel attention mechanism to capture the weighted correlations of the whole sequence and avoid potential bias brought by unrelated ones. Experimental results on three real-world datasets demonstrate that STING outperforms the existing state-of-the-art methods in terms of imputation accuracy as well as downstream tasks with the imputed values therein.