Title : 
A Short Survey on Data Clustering Algorithms
         
        
        
            Author_Institution : 
Dept. of Comput. Sci., City Univ. of Hong Kong, Kowloon Tong, China
         
        
        
        
        
            Abstract : 
With rapidly increasing data, clustering algorithms are important tools for data analytics in modern research. They have been successfully applied to a wide range of domains, for instance, bioinformatics, speech recognition, and financial analysis. Formally speaking, given a set of data instances, a clustering algorithm is expected to divide the set of data instances into the subsets which maximize the intra-subset similarity and inter-subset dissimilarity, where a similarity measure is defined beforehand. In this work, the state-of-the-arts clustering algorithms are reviewed from design concept to methodology, Different clustering paradigms are discussed. Advanced clustering algorithms are also discussed. After that, the existing clustering evaluation metrics are reviewed. A summary with future insights is provided at the end.
         
        
            Keywords : 
"Clustering algorithms","Clustering methods","Indexes","Algorithm design and analysis","Correlation","Benchmark testing","Bioinformatics"
         
        
        
            Conference_Titel : 
Soft Computing and Machine Intelligence (ISCMI), 2015 Second International Conference on
         
        
        
            DOI : 
10.1109/ISCMI.2015.10