DocumentCode :
1927781
Title :
Fast and efficient second-order training of the dynamic neural network paradigm
Author :
Gruber, Christian ; Sick, Bernhard
Author_Institution :
Passau Univ., Germany
Volume :
4
fYear :
2003
fDate :
20-24 July 2003
Firstpage :
2482
Abstract :
In many applications neural networks must process or generate time series, and various network paradigms exist for this purpose. Two prominent examples are time-delay neural networks (TDNN), which are known for their noise suppression capability, and NARX (nonlinear autoregressive models with exogenous inputs) networks, which have a powerful modeling ability (at least Turing equivalence). In this article, we suggest a combination of these two approaches, called dynamic neural network (DYNN), which unifies the particular advantages. Efficient training algorithms are needed to adjust the weights of DYNN. Here, we describe an algorithm for the computation of first-order information about the error surface: temporal backpropagation through time (TBPTT). Essentially, this algorithm is a combination of temporal backpropagation (used for TDNN) and backpropagation through time (used for NARX). The first-order information is then utilized to apply the scaled conjugate gradient (SCG) learning algorithm which approximates second-order with first-order information. The benefits of this approach are shown by means of two benchmark data sets: "logistic map" and "building". It is shown that SCG for DYNN is significantly faster and more accurate than other learning algorithms (e.g. TBPTT, resilient propagation, memoryless Quasi-Newton).
Keywords :
Turing machines; backpropagation; conjugate gradient methods; neural nets; time series; Turing equivalence; dynamic neural network paradigm; noise suppression; nonlinear autoregressive models; scaled conjugate gradient learning algorithm; second-order training; temporal backpropagation through time; time series; time-delay neural networks; Adaptive systems; Application software; Backpropagation algorithms; Computer architecture; Delay; Finite impulse response filter; Logistics; Neural networks; Neurofeedback; Predictive models;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2003. Proceedings of the International Joint Conference on
ISSN :
1098-7576
Print_ISBN :
0-7803-7898-9
Type :
conf
DOI :
10.1109/IJCNN.2003.1223954
Filename :
1223954
Link To Document :
بازگشت