Title :
Representation and generalization properties of class-entropy networks
Author :
Ridella, Sandro ; Rovetta, Stefano ; Zunino, Rodolfo
Author_Institution :
Dept. of Biophys. & Electron. Eng., Genoa Univ., Italy
fDate :
1/1/1999 12:00:00 AM
Abstract :
Using conditional class entropy (CCE) as a cost function allows feedforward networks to fully exploit classification-relevant information. CCE-based networks arrange the data space into partitions, which are assigned unambiguous symbols and are labeled by class information. By this labeling mechanism the network can model the empirical data distribution at the local level. Region labeling evolves with the network-training process, which follows a plastic algorithm. The paper proves several theoretical properties about the performance of CCE-based networks, and considers both convergence during training and generalization ability at run-time. In addition, analytical criteria and practical procedures are proposed to enhance the generalization performance of the trained networks. Experiments on artificial and real-world domains confirm the accuracy of this class of networks and witness the validity of the described methods
Keywords :
convergence; feedforward neural nets; generalisation (artificial intelligence); learning (artificial intelligence); minimum entropy methods; pattern classification; analytical criteria; class-entropy networks; classification-relevant information; conditional class entropy; convergence; empirical data distribution; generalization; plastic algorithm; region labeling; representation; Convergence; Cost function; Entropy; Labeling; Neurons; Partitioning algorithms; Performance analysis; Plastics; Runtime; Testing;
Journal_Title :
Neural Networks, IEEE Transactions on