DocumentCode
3574270
Title
A novel highly scalable clustering algorithm based on hyper edges and successive merging with randomization for complex data sets
Author
Ghosh, Arindam ; Ghosh, Ruma ; Mukherjee, Debaprasad
Author_Institution
Dept. of CSE & Dept. of IT., Dr. B.C. Roy Eng. Coll., Durgapur, India
fYear
2014
Firstpage
1507
Lastpage
1511
Abstract
Classification through clustering of complex data sets is a fundamental problem in computer science, and it has various applications in the fields of biomedicai sciences, weather prediction, web search, security and surveillance, information retrieval etc. Several algorithms are available on this area. Here, we develop a new algorithm for clustering of graph structured vector valued data, for final classification of the data into proper meaningful groups. In this algorithm, hyperedges are created where sets of consecutive edges based on similarity of feature vectors of nodes are created and successively merged. Furthermore, several randomization steps have been incorporated to overcome any errors in the classification due to bias in the input data set. We have justified the validity, accuracy and performance of the algorithm through analysis and preliminary simulations. Our analysis and simulations indicate a satisfactory outcome and provide support to our inferences.
Keywords
data handling; pattern classification; pattern clustering; clustering algorithm; complex data sets; consecutive edges; data classification; graph structured vector valued data; hyper edges; hyperedges; randomization steps; successive merging; Classification algorithms; Clustering algorithms; Computers; Data models; Inference algorithms; Merging; Support vector machine classification; clustering; feature vectors; hyperedges; randomization;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuit, Power and Computing Technologies (ICCPCT), 2014 International Conference on
Print_ISBN
978-1-4799-2395-3
Type
conf
DOI
10.1109/ICCPCT.2014.7054818
Filename
7054818
Link To Document