Title :
A Short Survey on Data Clustering Algorithms
Author_Institution :
Dept. of Comput. Sci., City Univ. of Hong Kong, Kowloon Tong, China
Abstract :
With rapidly increasing data, clustering algorithms are important tools for data analytics in modern research. They have been successfully applied to a wide range of domains, for instance, bioinformatics, speech recognition, and financial analysis. Formally speaking, given a set of data instances, a clustering algorithm is expected to divide the set of data instances into the subsets which maximize the intra-subset similarity and inter-subset dissimilarity, where a similarity measure is defined beforehand. In this work, the state-of-the-arts clustering algorithms are reviewed from design concept to methodology, Different clustering paradigms are discussed. Advanced clustering algorithms are also discussed. After that, the existing clustering evaluation metrics are reviewed. A summary with future insights is provided at the end.
Keywords :
"Clustering algorithms","Clustering methods","Indexes","Algorithm design and analysis","Correlation","Benchmark testing","Bioinformatics"
Conference_Titel :
Soft Computing and Machine Intelligence (ISCMI), 2015 Second International Conference on
DOI :
10.1109/ISCMI.2015.10