DocumentCode :
3108365
Title :
An improved MST-based clustering for biological data
Author :
Edla, D.R. ; Machavarapu, S. ; Jana, Prasanta K.
Author_Institution :
Dept. of Comput. Sci. & Eng., Indian Sch. of Mines, Dhanbad, India
fYear :
2012
fDate :
18-20 July 2012
Firstpage :
42
Lastpage :
47
Abstract :
Graph-based methods are widely used in cluster analysis because of their efficient functionality in a wide variety of problem domains. In this paper, we propose a new clustering algorithm with the help of minimum spanning tree. We remove the inconsistent edges of the MST by defining an error ratio based on the weights of the edges sorted in non-increasing order. Edges are removed until the error ratio is greater than certain value. An important advantage involved in this method is that the same threshold ratio is used for all the data sets. The proposed method is experimented on various two-dimensional synthetic and multi-dimensional real world data sets to prove the efficiency in detecting the complex clusters. Dynamic validity index is used to evaluate the clustering results of multi-dimensional data. The results of the proposed method are also compared with few existing clustering techniques. The results are encouraging.
Keywords :
biology computing; pattern clustering; trees (mathematics); MST-based clustering; biological data; cluster analysis; complex clusters; dynamic validity index; error ratio; graph-based methods; minimum spanning tree; real world data sets; sorted edges; threshold ratio; Algorithm design and analysis; Biology; Clustering algorithms; Image edge detection; Indexes; Partitioning algorithms; Shape; Clustering; biological data; dynamic validity index; minimum spanning tree;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Science & Engineering (ICDSE), 2012 International Conference on
Conference_Location :
Cochin, Kerala
Print_ISBN :
978-1-4673-2148-8
Type :
conf
DOI :
10.1109/ICDSE.2012.6282259
Filename :
6282259
Link To Document :
بازگشت