DocumentCode
2313631
Title
Information geometry on pruning of neural network
Author
Liu, Yun-Hui ; Luo, Si-Wel ; Li, Ai-jun ; Yu, Han-Bin
Author_Institution
Dept. of Comput. Sci., Beijing Jiaotong Univ., China
Volume
6
fYear
2004
fDate
26-29 Aug. 2004
Firstpage
3479
Abstract
The problem of determining the proper size of an artificial neural network is recognized to be crucial. One popular approach is pruning which means training a larger than necessary network and removing unnecessary weights/nodes. Though pruning is commonly used in architecture learning of neural network, there is still no theoretical framework about it. We give an information geometric explanation of pruning. In information geometric framework, most kinds of neural networks form exponential or mixture manifold which has a natural hierarchical structure. In a hierarchical set of systems, a lower order system is included in the parameter space of a large one as a submanifold. Such a parameter space has rich geometrical structures that are responsible for the dynamic behaviors of learning. The pruning problem is formulated in iterative m-projections from the current manifold to its submanifold in which the divergence between the two manifolds is minimized, and it means meaning the network performance does not worsen over the entire pruning process. The result gives a geometric understanding and an information geometric guideline of pruning, which has more authentic theoretic foundation.
Keywords
artificial intelligence; information theory; iterative methods; neural net architecture; architecture learning; artificial neural network; exponential manifold; information geometry; iterative m-projections; mixture manifold; pruning; Artificial neural networks; Computer science; Electronic mail; Guidelines; Information geometry; Information theory; Mathematical programming; Neural networks; Probability distribution; Solid modeling;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN
0-7803-8403-2
Type
conf
DOI
10.1109/ICMLC.2004.1380390
Filename
1380390
Link To Document