DocumentCode
1797265
Title
A new weight initialization method for sigmoidal feedforward artificial neural networks
Author
Sodhi, Sartaj Singh ; Chandra, P. ; Tanwar, Sudeep
Author_Institution
Sch. of Inf. & Commun. Technol., Guru Gobind Singh In-draprastha Univ., New Delhi, India
fYear
2014
fDate
6-11 July 2014
Firstpage
291
Lastpage
298
Abstract
Initial weight choice has been recognized to be an important aspect of the training methodology for sigmoidal feedforward neural networks. In this paper, a new mechanism for weight initialization is proposed. The mechanism distributes the initial input to output weights in a manner that all weights (including thresholds) leading into a hidden layer are uniformly distributed in a region and the center of the region from which the weights are sampled are such that no region overlaps for two distinct hidden nodes. The proposed method is compared against random weight initialization routines on five function approximation tasks using the Resilient Backpropagation (RPROP) algorithm for training. The proposed method is shown to lead to about twice as fast convergence to a pre-specifled goal for training as compared to any of the random weight initialization methods. Moreover, it is shown that at least for these problems the networks reach a deeper minima of the error functional during training and generalizes better than the networks trained whose weights were initialized by random weight initialization methods.
Keywords
backpropagation; feedforward neural nets; function approximation; FFANN; RPROP algorithm; deeper error functional minima; function approximation tasks; initial weight choice; random weight initialization method; resilient backpropagation algorithm; sigmoidal feedforward artificial neural networks; training methodology; Feedforward neural networks; Function approximation; Measurement uncertainty; Training; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4799-6627-1
Type
conf
DOI
10.1109/IJCNN.2014.6889373
Filename
6889373
Link To Document