Title :
On the choice of parameters of the cost function in nested modular RNN´s
Author :
Mandic, Danilo P. ; Chambers, Jonathon A.
Author_Institution :
Sch. of Inf. Syst., East Anglia Univ., Norwich, UK
fDate :
3/1/2000 12:00:00 AM
Abstract :
We address the choice of the coefficients in the cost function of a modular nested recurrent neural-network (RNN) architecture, known as the pipelined recurrent neural network (PRNN). Such a network can cope with the problem of vanishing gradient, experienced in prediction with RNN´s. Constraints on the coefficients of the cost function, in the form of a vector norm, are considered. Unlike the previous cost function for the PRNN, which included a forgetting factor motivated by the recursive least squares (RLS) strategy, the proposed forms of cost function provide “forgetting” of the outputs of adjacent modules based upon the network architecture. Such an approach takes into account the number of modules in the PRNN, through the unit norm constraint on the coefficients of the cost function of the PRNN. This is shown to be particularly suitable, since due to inherent nesting in the PRNN, every module gives its full contribution to the learning process, whereas the unit norm constrained cost function introduces a sense of forgetting in the memory management of the PRNN. The PRNN based upon a modified cost function outperforms existing PRNN schemes in the time series prediction simulations presented
Keywords :
least squares approximations; neural net architecture; pipeline processing; recurrent neural nets; PRNN; RLS strategy; cost function coefficient constraints; cost function coefficients; cost function parameters; forgetting; forgetting factor; modified cost function; nested modular RNN; pipelined recurrent neural network; recurrent neural-network architecture; recursive least squares strategy; time series prediction simulations; unit norm constrained cost function; vanishing gradient; vector norm; Cost function; Delay effects; Finite impulse response filter; IIR filters; Neural networks; Neurons; Pipeline processing; Predictive models; Recurrent neural networks; Signal processing algorithms;
Journal_Title :
Neural Networks, IEEE Transactions on