深层神经网络中的在线时空学习

论文标题

深层神经网络中的在线时空学习

Online Spatio-Temporal Learning in Deep Neural Networks

论文作者

Bohnstingl, Thomas, Woźniak, Stanisław, Maass, Wolfgang, Pantazi, Angeliki, Eleftheriou, Evangelos

论文摘要

生物神经网络具有固有的能力，可以通过在线学习不断适应。这一方面与通过应用于复发性神经网络（RNN）或最近在生物学启发的尖峰神经网络（SNN）的时间（BPTT）的错误背道（BPTT）形成鲜明对比。 BPTT涉及对梯度的离线计算，因为需要通过时间展开网络。最近，在线学习重新获得了研究界的注意力，重点是近似BPTT的方法或应用于SNNS的生物学上可行方案。在这里，我们提出了一种替代观点，该视角基于空间和时间梯度成分的明显分离。结合生物学的见解，我们从第一原则中得出了一种新颖的在线学习算法，用于深入SNN，称为在线时空学习（OSTL）。对于浅网络，OSTL与BPTT首次在线培训SNN具有bptt等效梯度的梯度等效。此外，拟议的配方在低时间复杂性时在线培训一类SNN体系结构。此外，我们将OSTL扩展到通用形式，适用于广泛的网络体系结构，包括包括长期记忆（LSTM）和封闭式复发单元（GRU）的网络。我们演示了算法对从语言建模到语音识别的各种任务的操作，并与BPTT基准相当。拟议的算法为开发SNN和一般Deep RNN的简洁有效的在线培训方法提供了一个框架。

Biological neural networks are equipped with an inherent capability to continuously adapt through online learning. This aspect remains in stark contrast to learning with error backpropagation through time (BPTT) applied to recurrent neural networks (RNNs), or recently to biologically-inspired spiking neural networks (SNNs). BPTT involves offline computation of the gradients due to the requirement to unroll the network through time. Online learning has recently regained the attention of the research community, focusing either on approaches that approximate BPTT or on biologically-plausible schemes applied to SNNs. Here we present an alternative perspective that is based on a clear separation of spatial and temporal gradient components. Combined with insights from biology, we derive from first principles a novel online learning algorithm for deep SNNs, called online spatio-temporal learning (OSTL). For shallow networks, OSTL is gradient-equivalent to BPTT enabling for the first time online training of SNNs with BPTT-equivalent gradients. In addition, the proposed formulation unveils a class of SNN architectures trainable online at low time complexity. Moreover, we extend OSTL to a generic form, applicable to a wide range of network architectures, including networks comprising long short-term memory (LSTM) and gated recurrent units (GRU). We demonstrate the operation of our algorithm on various tasks from language modelling to speech recognition and obtain results on par with the BPTT baselines. The proposed algorithm provides a framework for developing succinct and efficient online training approaches for SNNs and in general deep RNNs.

下载PDF全文

下载文献需遵守相关版权规定

论文标题