DocumentCode :
2637038
Title :
Hierarchical Document Clustering Using Fuzzy Association Rule Mining
Author :
Chen, Chun-Ling ; Tseng, Frank S C ; Liang, Tyne
Author_Institution :
Dept. of Comput. Sci., Nat. Chiao Tung Univ., Hsinchu
fYear :
2008
fDate :
18-20 June 2008
Firstpage :
326
Lastpage :
326
Abstract :
In this paper, we will present an effective Fuzzy Frequent Itemset-Based Hierarchical Clustering (F2IHC) approach, which uses fuzzy frequent itemsets discovered by fuzzy association rule mining to improve the clustering accuracy of FIHC (Frequent Itemset-Based Hierarchical Clustering) method. Our approach can alleviate the deficiencies of most of the traditional document clustering methods in dealing with the problems of high dimensionality, large data size, and meaningful cluster labels. We have conducted experiments to evaluate our approach on Reuters 21578 dataset. The experimental results show that our approach not only absolutely retains the merits of FIHC, but also improves the document clustering accuracy quality as compared with the FIHC method.
Keywords :
data mining; document handling; fuzzy set theory; pattern clustering; FIHC method; fuzzy association rule mining; fuzzy frequent itemset-based hierarchical clustering; hierarchical document clustering; Association rules; Clustering algorithms; Clustering methods; Data mining; Frequency; Fuzzy sets; Itemsets; Scalability; Text processing; Tree data structures;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Innovative Computing Information and Control, 2008. ICICIC '08. 3rd International Conference on
Conference_Location :
Dalian, Liaoning
Print_ISBN :
978-0-7695-3161-8
Electronic_ISBN :
978-0-7695-3161-8
Type :
conf
DOI :
10.1109/ICICIC.2008.305
Filename :
4603515
Link To Document :
بازگشت