DocumentCode :
3227296
Title :
A merging and splitting algorithm based on mutual information for design neural networks
Author :
Zhang, Zhaozhao ; Chen, Qili ; Qiao, Junfei
Author_Institution :
Coll. of Electron. & Control Eng., Beijing Univ. of Technol., Beijing, China
fYear :
2010
fDate :
23-26 Sept. 2010
Firstpage :
1268
Lastpage :
1272
Abstract :
This paper presents a new algorithm based on the theory of mutual information and the relationship between the neural networks´ architecture and learning capacity, called adaptive merging and splitting algorithm (AM SA), in designing optima feedforward neural networks (FNNs). This algorithm merges and splits hidden neurons during the training process of FNNs. Unlike most previous studies, AMSA puts emphasis on autonomous functioning in the design process of FNNs. This is the main reason why AMSA uses an adaptive not a predefined fixed strategy in designing FNNs. The adaptive strategy merges or splits hidden neurons based on the learning ability of hidden neurons or the training progress of FNNs. In order to reduce the amount of retraining after modifying FNN architectures, AMSA prunes hidden neurons by merging correlated hidden neurons and adds hidden neurons by splitting existing hidden neurons. In merge operation, the hidden neurons is merged based on mutual information criterion, resulting in compact neural networks architecture without decreasing the information processing capacity. In split operation, a new hidden neuron is added by splitting a hidden neuron in current neural networks, resulting in improve the information processing capacity. AMSA has been tested on the regression problem. The experimental results show that AMSA can design compact FNNs architectures with good generalization ability compared to other algorithm.
Keywords :
feedforward neural nets; learning (artificial intelligence); regression analysis; adaptive merging algorithm; information processing capacity; learning capacity; mutual information criterion; neural networks architecture; optima feedforward neural networks; regression problem; splitting algorithm; Bayesian methods; Adaptive search strategy; Architecture design; Feedforward neural network (FNN); Mutual information (MI);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bio-Inspired Computing: Theories and Applications (BIC-TA), 2010 IEEE Fifth International Conference on
Conference_Location :
Changsha
Print_ISBN :
978-1-4244-6437-1
Type :
conf
DOI :
10.1109/BICTA.2010.5645080
Filename :
5645080
Link To Document :
بازگشت