A novel highly scalable clustering algorithm based on hyper edges and successive merging with randomization for complex data sets

Author

Ghosh, Arindam ; Ghosh, Ruma ; Mukherjee, Debaprasad

Author_Institution

Dept. of CSE & Dept. of IT., Dr. B.C. Roy Eng. Coll., Durgapur, India

fYear

2014

Firstpage

1507

Lastpage

1511

Abstract

Classification through clustering of complex data sets is a fundamental problem in computer science, and it has various applications in the fields of biomedicai sciences, weather prediction, web search, security and surveillance, information retrieval etc. Several algorithms are available on this area. Here, we develop a new algorithm for clustering of graph structured vector valued data, for final classification of the data into proper meaningful groups. In this algorithm, hyperedges are created where sets of consecutive edges based on similarity of feature vectors of nodes are created and successively merged. Furthermore, several randomization steps have been incorporated to overcome any errors in the classification due to bias in the input data set. We have justified the validity, accuracy and performance of the algorithm through analysis and preliminary simulations. Our analysis and simulations indicate a satisfactory outcome and provide support to our inferences.

Keywords

data handling; pattern classification; pattern clustering; clustering algorithm; complex data sets; consecutive edges; data classification; graph structured vector valued data; hyper edges; hyperedges; randomization steps; successive merging; Classification algorithms; Clustering algorithms; Computers; Data models; Inference algorithms; Merging; Support vector machine classification; clustering; feature vectors; hyperedges; randomization;

fLanguage

English

Publisher

ieee

Conference_Titel

Circuit, Power and Computing Technologies (ICCPCT), 2014 International Conference on

Print_ISBN

978-1-4799-2395-3

Type

conf

DOI

10.1109/ICCPCT.2014.7054818

Filename

7054818