Title :
Hierarchical Document Clustering Using Fuzzy Association Rule Mining
Author :
Chen, Chun-Ling ; Tseng, Frank S C ; Liang, Tyne
Author_Institution :
Dept. of Comput. Sci., Nat. Chiao Tung Univ., Hsinchu
Abstract :
In this paper, we will present an effective Fuzzy Frequent Itemset-Based Hierarchical Clustering (F2IHC) approach, which uses fuzzy frequent itemsets discovered by fuzzy association rule mining to improve the clustering accuracy of FIHC (Frequent Itemset-Based Hierarchical Clustering) method. Our approach can alleviate the deficiencies of most of the traditional document clustering methods in dealing with the problems of high dimensionality, large data size, and meaningful cluster labels. We have conducted experiments to evaluate our approach on Reuters 21578 dataset. The experimental results show that our approach not only absolutely retains the merits of FIHC, but also improves the document clustering accuracy quality as compared with the FIHC method.
Keywords :
data mining; document handling; fuzzy set theory; pattern clustering; FIHC method; fuzzy association rule mining; fuzzy frequent itemset-based hierarchical clustering; hierarchical document clustering; Association rules; Clustering algorithms; Clustering methods; Data mining; Frequency; Fuzzy sets; Itemsets; Scalability; Text processing; Tree data structures;
Conference_Titel :
Innovative Computing Information and Control, 2008. ICICIC '08. 3rd International Conference on
Conference_Location :
Dalian, Liaoning
Print_ISBN :
978-0-7695-3161-8
Electronic_ISBN :
978-0-7695-3161-8
DOI :
10.1109/ICICIC.2008.305