Title :
Effect of a sparse architecture on generalization behaviour of connectionist networks: a comparative study
Author :
Chakraborty, Basabi
Author_Institution :
Fac. of Software & Inf. Sci., Iwate Prefectural Univ., Japan
fDate :
6/21/1905 12:00:00 AM
Abstract :
Generalization, the ability of achieving equal performance with respect to the training patterns for the design of the system as well as the unknown test patterns outside the training set, is considered to be the most desirable aspect of a cognitive learning system. For connectionist classifiers, generalization depends on several factors like the network architecture and size, learning algorithm, complexity of the problem and the quality and quantity of the training samples. Various studies on the generalization behaviour of a neural classifier suggest that the architecture and the size of the network should match the size and complexity of the training sample set of the particular problem. The popular ideas for improving generalization is network pruning by removing redundant nodes or growing by adding nodes to reach the optimum size for matching. In this work a feedforward multilayer connectionist model proposed earlier by Chakraborty et al. (1997) with a sparse fractal connection structure between the neurons of adjacent layers has been studied for its suitability compared to other architectures for achieving good generalization. A comparative study with another feedforward multilayer model with network pruning algorithms for pattern classification problem revealed that it is easier to achieve good generalization with the proposed sparse fractal architecture
Keywords :
feedforward neural nets; generalisation (artificial intelligence); learning (artificial intelligence); multilayer perceptrons; neural net architecture; pattern classification; cognitive learning system; complexity; connectionist classifiers; connectionist networks; feedforward multilayer connectionist model; generalization behaviour; network pruning; redundant nodes; sparse architecture; sparse fractal connection structure; training patterns; unknown test patterns; Computer architecture; Fractals; Information science; Learning systems; Neurons; Nonhomogeneous media; Pattern classification; Software testing; System testing;
Conference_Titel :
Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on
Conference_Location :
Tokyo
Print_ISBN :
0-7803-5731-0
DOI :
10.1109/ICSMC.1999.823241