Title : 
Hierarchical Document Clustering Using Fuzzy Association Rule Mining
         
        
            Author : 
Chen, Chun-Ling ; Tseng, Frank S C ; Liang, Tyne
         
        
            Author_Institution : 
Dept. of Comput. Sci., Nat. Chiao Tung Univ., Hsinchu
         
        
        
        
        
        
            Abstract : 
In this paper, we will present an effective Fuzzy Frequent Itemset-Based Hierarchical Clustering (F2IHC) approach, which uses fuzzy frequent itemsets discovered by fuzzy association rule mining to improve the clustering accuracy of FIHC (Frequent Itemset-Based Hierarchical Clustering) method. Our approach can alleviate the deficiencies of most of the traditional document clustering methods in dealing with the problems of high dimensionality, large data size, and meaningful cluster labels. We have conducted experiments to evaluate our approach on Reuters 21578 dataset. The experimental results show that our approach not only absolutely retains the merits of FIHC, but also improves the document clustering accuracy quality as compared with the FIHC method.
         
        
            Keywords : 
data mining; document handling; fuzzy set theory; pattern clustering; FIHC method; fuzzy association rule mining; fuzzy frequent itemset-based hierarchical clustering; hierarchical document clustering; Association rules; Clustering algorithms; Clustering methods; Data mining; Frequency; Fuzzy sets; Itemsets; Scalability; Text processing; Tree data structures;
         
        
        
        
            Conference_Titel : 
Innovative Computing Information and Control, 2008. ICICIC '08. 3rd International Conference on
         
        
            Conference_Location : 
Dalian, Liaoning
         
        
            Print_ISBN : 
978-0-7695-3161-8
         
        
            Electronic_ISBN : 
978-0-7695-3161-8
         
        
        
            DOI : 
10.1109/ICICIC.2008.305