Information geometry of neural learning and belief propagation

Author

Amari, Shun-Ichi

Volume

fYear

2002

fDate

18-22 Nov. 2002

Abstract

Summary form only given. Many complex systems such as the brain work sufficiently well under stochastic uncertainty and noise fluctuations. Learning and statistical inference play a key role in such systems. Information geometry emerged from studies of the intrinsic structure of the manifold of probability distributions and is applicable to a wide variety of information systems. Information geometry provides a new tool for elucidating the intrinsic structure of such systems. The article gives an understandable introduction to information geometry. We then show how the geometric concepts are applied to learning algorithms in neural networks. The set of neural networks, for example multilayer perceptrons, forms a manifold, where modifiable parameters are regarded as coordinates. This is called the neuromanifold, which is Riemannian, and in which learning takes place. The dynamics of learning is represented by a trajectory in this Riemannian space. By using this example, we explain why the geometrical consideration is important. We further propose a new efficient learning method called the natural gradient method which outperforms the error backpropagation method. Belief propagation is an important method developed in artificial intelligence, where stochastic inference takes place under graphical models. This is a hot topic connecting artificial intelligence, learning, error correcting codes and stochastic reasoning. Information geometry elucidates the intrinsic structure of the belief propagation method, and unifies the AI approach, information coding, statistical approach, and statistical physical method. Such hot topics are also explained.

Keywords

belief maintenance; geometry; gradient methods; learning (artificial intelligence); neural nets; stochastic processes; Riemannian space; belief propagation; graphical models; information coding; information geometry; information systems; learning algorithms; modifiable parameters; multilayer perceptrons; natural gradient method; neural learning; neural networks; neuromanifold; probability distributions; statistical physical method; stochastic inference; stochastic reasoning; Artificial intelligence; Belief propagation; Biological neural networks; Fluctuations; Information geometry; Learning; Stochastic processes; Stochastic resonance; Stochastic systems; Uncertainty;

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Information Processing, 2002. ICONIP '02. Proceedings of the 9th International Conference on

Print_ISBN

981-04-7524-1

Type

conf

DOI

10.1109/ICONIP.2002.1198187

Filename

1198187

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3440652