Title :
Evaluation framework of hierarchical clustering methods for binary data
Author :
Tamasauskas, Darius ; Sakalauskas, V. ; Kriksciuniene, D.
Author_Institution :
Dept. of Finance, Nordea Bank Finland Pic, Finland
Abstract :
The article aims to evaluate hierarchical clustering methods according to their performance for binary data type. We explore the accuracy of ten hierarchical clustering methods by experimenting with ten different distance measures. The three types of well, poorly and very poorly separated clusters of binary data sets are generated by selecting the appropriate parameters for binomial distribution and Monte Carlo method. In order to evaluate the precision of clustering methods the binary data sets are transformed to distance matrices. The error level each method is explored in relationship to distance measures, cluster types and data distributions. The Complete linkage, Flexible-beta and Ward´s methods have best clustering performance for the case of two well separated clusters of binary data.
Keywords :
Monte Carlo methods; binomial distribution; matrix algebra; pattern clustering; Monte Carlo method; Ward methods; binary data set cluster; binary data type; binomial distribution; cluster types; complete linkage; data distributions; different distance measures; distance matrices; evaluation framework; flexible-beta; hierarchical clustering methods; FCC; Frequency modulation; Hybrid intelligent systems; IP networks; Integrated circuits; Iron; Cluster analysis; Monte Carlo simulation; binary data; distance matrix; hierarchical clustering;
Conference_Titel :
Hybrid Intelligent Systems (HIS), 2012 12th International Conference on
Conference_Location :
Pune
Print_ISBN :
978-1-4673-5114-0
DOI :
10.1109/HIS.2012.6421371