DocumentCode :
827709
Title :
Generalized RLS approach to the training of neural networks
Author :
Xu, Yong ; Wong, Kwok-Wo ; Leung, Chi-Sing
Author_Institution :
City Univ. of Hong Kong, China
Volume :
17
Issue :
1
fYear :
2006
Firstpage :
19
Lastpage :
34
Abstract :
Recursive least square (RLS) is an efficient approach to neural network training. However, in the classical RLS algorithm, there is no explicit decay in the energy function. This will lead to an unsatisfactory generalization ability for the trained networks. In this paper, we propose a generalized RLS (GRLS) model which includes a general decay term in the energy function for the training of feedforward neural networks. In particular, four different weight decay functions, namely, the quadratic weight decay, the constant weight decay and the newly proposed multimodal and quartic weight decay are discussed. By using the GRLS approach, not only the generalization ability of the trained networks is significantly improved but more unnecessary weights are pruned to obtain a compact network. Furthermore, the computational complexity of the GRLS remains the same as that of the standard RLS algorithm. The advantages and tradeoffs of using different decay functions are analyzed and then demonstrated with examples. Simulation results show that our approach is able to meet the design goals: improving the generalization ability of the trained network while getting a compact network.
Keywords :
computational complexity; feedforward neural nets; learning (artificial intelligence); least squares approximations; computational complexity; constant weight decay; feedforward neural network; generalized recursive least square; multimodal weight decay; neural network training; quadratic weight decay; quartic weight decay; Artificial neural networks; Backpropagation algorithms; Computational complexity; Computational modeling; Feedforward neural networks; Filtering; Kalman filters; Least squares methods; Neural networks; Resonance light scattering; Extended Kalman filtering (EKF); neural network; recursive least square (RLS) algorithm; weight decay; Algorithms; Artificial Intelligence; Breast Neoplasms; Computer Simulation; Female; Forecasting; Humans; Least-Squares Analysis; Models, Neurological; Neural Networks (Computer); Solar Activity;
fLanguage :
English
Journal_Title :
Neural Networks, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9227
Type :
jour
DOI :
10.1109/TNN.2005.860857
Filename :
1593689
Link To Document :
بازگشت