Title :
Overtraining and model selection with the self-organizing map
Author :
Lampinen, Jouko ; Kostiainen, Timo
Author_Institution :
Lab. of Comput. Eng., Helsinki Univ. of Technol., Espoo, Finland
Abstract :
We discuss the importance of finding the correct model complexity and regularization level, in the self-organizing map (SOM) algorithm. The complexity of the SOM is determined mainly by the width of the final neighborhood, which is usually chosen ad hoc or set to zero for optimal quantization error. However, if the SOM is used for visualizing the joint probability distribution of the data, then care must be taken not to overfit the model to the data sample, similarly as with any statistical model. We propose a heuristic criterion for model selection in SOM, and demonstrate by simulations that the criterion can be used for selecting the neighborhood that suppresses overfitting
Keywords :
generalisation (artificial intelligence); learning (artificial intelligence); probability; self-organising feature maps; final neighborhood; heuristic criterion; joint probability distribution; model complexity; model selection; optimal quantization error; overfitting suppression; overtraining; regularization level; self-organizing map; Bayesian methods; Data visualization; Kernel; Laboratories; Lattices; Nearest neighbor searches; Probability distribution; Quantization; Smoothing methods; Training data;
Conference_Titel :
Neural Networks, 1999. IJCNN '99. International Joint Conference on
Conference_Location :
Washington, DC
Print_ISBN :
0-7803-5529-6
DOI :
10.1109/IJCNN.1999.832673