DocumentCode
1442062
Title
Asymptotic distributions associated to Oja´s learning equation for neural networks
Author
Delmas, Jean-Pierre ; Cardos, J.F.
Author_Institution
Inst. Nat. des Telecommun., Evry, France
Volume
9
Issue
6
fYear
1998
fDate
11/1/1998 12:00:00 AM
Firstpage
1246
Lastpage
1257
Abstract
We perform a complete asymptotic performance analysis of the stochastic approximation algorithm (denoted subspace network learning algorithm) derived from Oja´s learning equation, in the case where the learning rate is constant and a large number of patterns is available. This algorithm drives the connection weight matrix W to an orthonormal basis of a dominant invariant subspace of a covariance matrix. Our approach consists in associating to this algorithm a second stochastic approximation algorithm that governs the evolution of WWT to the projection matrix onto this dominant invariant subspace. Then, using a general result of Gaussian approximation theory, we derive the asymptotic distribution of the estimated projection matrix. Closed form expressions of the asymptotic covariance of the projection matrix estimated by the SNL algorithm, and by the smoothed SNL algorithm that we introduce, are given in the case of independent or correlated learning patterns and are further analyzed. It is found that the structures of these asymptotic covariance matrices are similar to those describing batch estimation techniques. The accuracy or our asymptotic analysis is checked by numerical simulations and it is found to be valid not only for a “small” learning rate but in a very large domain. Finally, improvements brought by our smoothed SNL algorithm are shown, such as the learning speed/misadjustment tradeoff and the deviation from orthonormality
Keywords
approximation theory; covariance matrices; learning (artificial intelligence); neural nets; principal component analysis; Gaussian approximation theory; Oja´s learning equation; asymptotic covariance; asymptotic distributions; asymptotic performance analysis; closed form expressions; connection weight matrix; correlated learning patterns; dominant invariant subspace; independent learning patterns; learning rate; learning speed/misadjustment tradeoff; orthonormal basis; orthonormality; projection matrix; small learning rate; smoothed SNL algorithm; stochastic approximation algorithm; subspace network learning algorithm; Adaptive estimation; Algorithm design and analysis; Approximation algorithms; Covariance matrix; Equations; Neural networks; Pattern analysis; Principal component analysis; Signal processing algorithms; Stochastic processes;
fLanguage
English
Journal_Title
Neural Networks, IEEE Transactions on
Publisher
ieee
ISSN
1045-9227
Type
jour
DOI
10.1109/72.728373
Filename
728373
Link To Document