Title :
Robust neural gas for the analysis of data with outliers
Author :
Allende, Héctor ; Rogel, Cristian ; Moreno, Sebastián ; Salas, Rodrgo
Author_Institution :
Univ. Tecnica Federico Santa Maria, Valparaiso, Chile
Abstract :
Learning the structure of real world data is difficult both to recognize and describe. The structure may contain high dimensional clusters that are related in complex ways. Furthermore, real data sets may contain several outliers. Vector quantization techniques has been successfully applied as a data mining tool. In particular the neural gas (NG) is a variant of the self organizing map (SOM) where the neighborhoods are adaptively defined during training through the ranking order of the distance of prototypes from the given training sample. Unfortunately, the learning algorithm of the NG is sensitive to the presence of outliers as we show in this paper. Due to the influence of the outliers in the learning process, the topology of the employed network does not conserve the topology of the manifold of the data which is presented. In this paper, we propose to robustify the learning algorithm where the parameter estimation process is resistant to the presence of outliers in the data. We call this algorithm robust neural gas (RNG). We illustrate our technique on synthetic and real data sets.
Keywords :
data analysis; data mining; learning (artificial intelligence); parameter estimation; self-organising feature maps; artificial neural networks; data mining; high dimensional clusters; parameter estimation; robust learning algorithm; robust neural gas; self organizing map; vector quantization; Data analysis; Data mining; Databases; Network topology; Neural networks; Organizing; Prototypes; Robustness; Telecommunication network topology; Vector quantization; Artificial Neural Networks; Data Mining; Neural Gas; Robust Learning Algorithm;
Conference_Titel :
Computer Science Society, 2004. SCCC 2004. 24th International Conference of the Chilean
Print_ISBN :
0-7695-2185-1
DOI :
10.1109/QEST.2004.18