DocumentCode
721227
Title
Data clustering approaches survey and analysis
Author
Ahalya, G. ; Pandey, Hari Mohan
Author_Institution
CSE Dept., Amity Univ., Noida, India
fYear
2015
fDate
25-27 Feb. 2015
Firstpage
532
Lastpage
537
Abstract
In the current world, there is a need to analyze and extract information from data. Clustering is one such analytical method which involves the distribution of data into groups of identical objects. Every group is known as a cluster, which consists of objects that have affinity within the cluster and disparity with the objects in other groups. This paper is intended to examine and evaluate various data clustering algorithms. The two major categories of clustering approaches are partition and hierarchical clustering. The algorithms which are dealt here are: k-means clustering algorithm, hierarchical clustering algorithm, density based clustering algorithm, self-organizing map algorithm, and expectation maximization clustering algorithm. All the mentioned algorithms are explained and analyzed based on the factors like the size of the dataset, type of the data set, number of clusters created, quality, accuracy and performance. This paper also provides the information about the tools which are used to implement the clustering approaches. The purpose of discussing the various software/tools is to make the beginners and new researchers to understand the working, which will help them to come up with new product and approaches for the improvement.
Keywords
expectation-maximisation algorithm; pattern clustering; self-organising feature maps; data clustering algorithms; data distribution; density based clustering algorithm; expectation maximization clustering algorithm; hierarchical clustering algorithm; k-means clustering algorithm; self-organizing map algorithm; Accuracy; Algorithm design and analysis; Clustering algorithms; Noise; Partitioning algorithms; Software; Software algorithms; Clustering; Expectation maximization clustering algorithm; Hierarchical clustering; K-means clustering algorithm; Self-organization maps algorithm; density based clustering algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Futuristic Trends on Computational Analysis and Knowledge Management (ABLAZE), 2015 International Conference on
Conference_Location
Noida
Print_ISBN
978-1-4799-8432-9
Type
conf
DOI
10.1109/ABLAZE.2015.7154919
Filename
7154919
Link To Document