Title :
Hybrid clustering with application to Web mining
Author_Institution :
Sch. of Software Eng. & Data Commun., Queensland Univ. of Technol., Brisbane, Qld., Australia
Abstract :
Clustering algorithms fall into two categories: hierarchical clustering and partitional clustering. For hierarchical algorithms, they are static in the sense that they never undo what was done previously, which means that, objects which are committed to a cluster in the early stages, cannot move to another cluster. Partitional clustering does not suffer from this problem, but requires a pre-specified number for the output clusters. This paper presents a hybrid clustering method that combines the advantages of hierarchical clustering and partitional clustering techniques. The proposed hybrid algorithm does not require a number for the output clusters prior to the clustering and the clusters can be rearranged according to a quality measurement. In the present paper, we apply this method to Web page clustering and provide necessary experimental results.
Keywords :
Internet; data mining; pattern clustering; Web mining; Web page clustering; hierarchical clustering algorithm; partitional clustering algorithm; quality measurement; Application software; Clustering algorithms; Clustering methods; Data communication; Merging; Partitioning algorithms; Software algorithms; Software engineering; Web mining; Web pages;
Conference_Titel :
Active Media Technology, 2005. (AMT 2005). Proceedings of the 2005 International Conference on
Print_ISBN :
0-7803-9035-0
DOI :
10.1109/AMT.2005.1505425