DocumentCode :
699955
Title :
Unsupervised clustering on multi-component datasets: Applications on images and astrophysics data
Author :
Galluccio, L. ; Michel, O. ; Comon, P.
Author_Institution :
I3S Lab., Univ. of Nice Sophia Antipolis, Nice, France
fYear :
2008
fDate :
25-29 Aug. 2008
Firstpage :
1
Lastpage :
5
Abstract :
This paper proposes an original approach to cluster multicomponent data sets with an estimation of the number of clusters. From the construction of a minimal spanning tree with Prim´s algorithm and the assumption that the vertices are approximately distributed according to a Poisson distribution, the number of clusters is estimated by thresholding the Prim´s trajectory. The corresponding cluster centroids are then computed in order to initialize the Generalized Lloyd´s algorithm, also known as K-means, which allows to circumvent initialization problems. Metrics used for measuring similarity between multi-dimensional data points are based on symmetrical divergences. The use of these informational divergences together with the proposed method lead to better results than some other clustering methods in the framework of astrophysical data processing. An application of this method in the multi-spectral imagery domain with a satellite view of Paris is also presented.
Keywords :
astronomy computing; image processing; Poisson distribution; Prim algorithm; Prim trajectory; astrophysical data processing; cluster multicomponent data sets; clustering methods; generalized Lloyd algorithm; informational divergences; minimal spanning tree; multispectral imagery domain; unsupervised clustering; Clustering algorithms; Extraterrestrial measurements; Signal processing; Signal processing algorithms; Trajectory; Wavelength measurement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2008 16th European
Conference_Location :
Lausanne
ISSN :
2219-5491
Type :
conf
Filename :
7080487
Link To Document :
بازگشت