Title :
Robust Initialization of a Jordan Network With Recurrent Constrained Learning
Author_Institution :
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
Abstract :
In this paper, we propose a robust initialization of a Jordan network with a recurrent constrained learning (RIJNRCL) algorithm for multilayered recurrent neural networks (RNNs). This novel algorithm is based on the constrained learning concept of the Jordan network with a recurrent sensitivity and weight convergence analysis, which is used to obtain a tradeoff between the training and testing errors. In addition to using classical techniques for the adaptive learning rate and the adaptive dead zone, RIJNRCL employs a recurrent constrained parameter matrix to switch off excessive contributions from the hidden layer neurons based on weight convergence and stability conditions of the multilayered RNNs. It is well known that a good response from the hidden layer neurons and proper initialization play a dominant role in avoiding local minima in multilayered RNNs. The new RIJNRCL algorithm solves the twin problems of weight initialization and selection of the hidden layer neurons via a novel recurrent sensitivity ratio analysis. We provide the detailed steps for using RIJNRCL in a few benchmark time-series prediction problems and show that the proposed algorithm achieves superior generalization performance.
Keywords :
constraint theory; convergence; learning (artificial intelligence); matrix algebra; prediction theory; recurrent neural nets; time series; Jordan network; adaptive dead zone; adaptive learning rate; hidden layer neurons; multilayered recurrent neural networks; parameter matrix; recurrent constrained learning; time series prediction; weight convergence analysis; weight initialization; Adaptive systems; Algorithm design and analysis; Convergence; Recurrent neural networks; Sensitivity analysis; Stability analysis; Time series analysis; Generalization; real-time rucurrent learning; recurrent sensitivity analysis; robust initialization; time series prediction; weight convergence proof; Algorithms; Computer Simulation; Models, Theoretical; Neural Networks (Computer);
Journal_Title :
Neural Networks, IEEE Transactions on
DOI :
10.1109/TNN.2011.2168423