DocumentCode
3440652
Title
Information geometry of neural learning and belief propagation
Author
Amari, Shun-Ichi
Volume
2
fYear
2002
fDate
18-22 Nov. 2002
Abstract
Summary form only given. Many complex systems such as the brain work sufficiently well under stochastic uncertainty and noise fluctuations. Learning and statistical inference play a key role in such systems. Information geometry emerged from studies of the intrinsic structure of the manifold of probability distributions and is applicable to a wide variety of information systems. Information geometry provides a new tool for elucidating the intrinsic structure of such systems. The article gives an understandable introduction to information geometry. We then show how the geometric concepts are applied to learning algorithms in neural networks. The set of neural networks, for example multilayer perceptrons, forms a manifold, where modifiable parameters are regarded as coordinates. This is called the neuromanifold, which is Riemannian, and in which learning takes place. The dynamics of learning is represented by a trajectory in this Riemannian space. By using this example, we explain why the geometrical consideration is important. We further propose a new efficient learning method called the natural gradient method which outperforms the error backpropagation method. Belief propagation is an important method developed in artificial intelligence, where stochastic inference takes place under graphical models. This is a hot topic connecting artificial intelligence, learning, error correcting codes and stochastic reasoning. Information geometry elucidates the intrinsic structure of the belief propagation method, and unifies the AI approach, information coding, statistical approach, and statistical physical method. Such hot topics are also explained.
Keywords
belief maintenance; geometry; gradient methods; learning (artificial intelligence); neural nets; stochastic processes; Riemannian space; belief propagation; graphical models; information coding; information geometry; information systems; learning algorithms; modifiable parameters; multilayer perceptrons; natural gradient method; neural learning; neural networks; neuromanifold; probability distributions; statistical physical method; stochastic inference; stochastic reasoning; Artificial intelligence; Belief propagation; Biological neural networks; Fluctuations; Information geometry; Learning; Stochastic processes; Stochastic resonance; Stochastic systems; Uncertainty;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Information Processing, 2002. ICONIP '02. Proceedings of the 9th International Conference on
Print_ISBN
981-04-7524-1
Type
conf
DOI
10.1109/ICONIP.2002.1198187
Filename
1198187
Link To Document