Title :
Selection of Cluster Hierarchy Depth in Hierarchical Clustering Using K-Means Algorithm
Author :
Lee, Shinwon ; Lee, Wonhee ; Chung, Sungjong ; An, Dongun ; Bok, Ingeun ; Ryu, Hongjin
Author_Institution :
Chonbuk Nat. Univ., Chonju
Abstract :
Many papers have shown that the hierarchical clustering method takes good-performance, but is limited because of its quadratic time complexity. In contrast, with a large number of variables, K-means has a time complexity that is linear in the number of documents, but is thought to produce inferior clusters. Think of the factor of simplify, high-quality and high-efficiency, we combine the two approaches providing a new system named CONDOR system with hierarchical structure based on document clustering using K-means algorithm. Evaluated the performance on different hierarchy depth and initial uncertain centroid number based on variational relative document amount correspond to given queries. Comparing with regular method that the initial centroids have been established in advance, our method performance has been improved a lot.
Keywords :
computational complexity; document handling; pattern clustering; CONDOR system; K-means algorithm; cluster hierarchy depth; document clustering; hierarchical clustering; time complexity; Clustering algorithms; Clustering methods; Information analysis; Information retrieval; Information technology; Merging; Metasearch; Natural languages; Partitioning algorithms; Speech;
Conference_Titel :
Information Technology Convergence, 2007. ISITC 2007. International Symposium on
Conference_Location :
Joenju
Print_ISBN :
0-7695-3045-1
Electronic_ISBN :
978-0-7695-3045-1
DOI :
10.1109/ISITC.2007.5