Title :
A novel highly scalable clustering algorithm based on hyper edges and successive merging with randomization for complex data sets
Author :
Ghosh, Arindam ; Ghosh, Ruma ; Mukherjee, Debaprasad
Author_Institution :
Dept. of CSE & Dept. of IT., Dr. B.C. Roy Eng. Coll., Durgapur, India
Abstract :
Classification through clustering of complex data sets is a fundamental problem in computer science, and it has various applications in the fields of biomedicai sciences, weather prediction, web search, security and surveillance, information retrieval etc. Several algorithms are available on this area. Here, we develop a new algorithm for clustering of graph structured vector valued data, for final classification of the data into proper meaningful groups. In this algorithm, hyperedges are created where sets of consecutive edges based on similarity of feature vectors of nodes are created and successively merged. Furthermore, several randomization steps have been incorporated to overcome any errors in the classification due to bias in the input data set. We have justified the validity, accuracy and performance of the algorithm through analysis and preliminary simulations. Our analysis and simulations indicate a satisfactory outcome and provide support to our inferences.
Keywords :
data handling; pattern classification; pattern clustering; clustering algorithm; complex data sets; consecutive edges; data classification; graph structured vector valued data; hyper edges; hyperedges; randomization steps; successive merging; Classification algorithms; Clustering algorithms; Computers; Data models; Inference algorithms; Merging; Support vector machine classification; clustering; feature vectors; hyperedges; randomization;
Conference_Titel :
Circuit, Power and Computing Technologies (ICCPCT), 2014 International Conference on
Print_ISBN :
978-1-4799-2395-3
DOI :
10.1109/ICCPCT.2014.7054818