Asymptotic distributions associated to Oja´s learning equation for neural networks

Author

Delmas, Jean-Pierre ; Cardos, J.F.

Author_Institution

Inst. Nat. des Telecommun., Evry, France

Volume

9

Issue

6

fYear

1998

fDate

11/1/1998 12:00:00 AM

Firstpage

1246

Lastpage

1257

Abstract

We perform a complete asymptotic performance analysis of the stochastic approximation algorithm (denoted subspace network learning algorithm) derived from Oja´s learning equation, in the case where the learning rate is constant and a large number of patterns is available. This algorithm drives the connection weight matrix W to an orthonormal basis of a dominant invariant subspace of a covariance matrix. Our approach consists in associating to this algorithm a second stochastic approximation algorithm that governs the evolution of WW^T to the projection matrix onto this dominant invariant subspace. Then, using a general result of Gaussian approximation theory, we derive the asymptotic distribution of the estimated projection matrix. Closed form expressions of the asymptotic covariance of the projection matrix estimated by the SNL algorithm, and by the smoothed SNL algorithm that we introduce, are given in the case of independent or correlated learning patterns and are further analyzed. It is found that the structures of these asymptotic covariance matrices are similar to those describing batch estimation techniques. The accuracy or our asymptotic analysis is checked by numerical simulations and it is found to be valid not only for a “small” learning rate but in a very large domain. Finally, improvements brought by our smoothed SNL algorithm are shown, such as the learning speed/misadjustment tradeoff and the deviation from orthonormality

Keywords

approximation theory; covariance matrices; learning (artificial intelligence); neural nets; principal component analysis; Gaussian approximation theory; Oja´s learning equation; asymptotic covariance; asymptotic distributions; asymptotic performance analysis; closed form expressions; connection weight matrix; correlated learning patterns; dominant invariant subspace; independent learning patterns; learning rate; learning speed/misadjustment tradeoff; orthonormal basis; orthonormality; projection matrix; small learning rate; smoothed SNL algorithm; stochastic approximation algorithm; subspace network learning algorithm; Adaptive estimation; Algorithm design and analysis; Approximation algorithms; Covariance matrix; Equations; Neural networks; Pattern analysis; Principal component analysis; Signal processing algorithms; Stochastic processes;

fLanguage

English

Journal_Title

Neural Networks, IEEE Transactions on

Publisher

ieee

ISSN

1045-9227

Type

jour

DOI

10.1109/72.728373

Filename

728373