DocumentCode :
2897868
Title :
Unsupervised document clustering based on keyword clusters
Author :
Chang, Hsi-Cheng ; Hsu, Chiun-Chieh ; Deng, Yi-Wen
Author_Institution :
Dept. of Electron. Eng., Hwa Hsia Coll. of Technol. & Commerce, Taipei, Taiwan
Volume :
2
fYear :
2004
fDate :
26-29 Oct. 2004
Firstpage :
1198
Abstract :
Due to the explosion growth of digital information, automatic document clustering or categorization has been an important research topic. Since document clustering has high dimension, the magnitude of the representation features will influence the efficiency and effect of the clustering and the precision of the clustering results. This paper presents an unsupervised document clustering method based on partitioning a weighted undirected graph. It initially discovers a set of tightly relevant keyword clusters that are disposed throughout the feature space of the collection of documents, and further clusters the documents into document clusters by using these keyword clusters. The experimental results show that the proposed approach can efficiently produce higher quality document clustering as compared with several well-known document clustering algorithms.
Keywords :
document handling; graph theory; pattern clustering; relevance feedback; automatic document categorization; keyword clusters; tightly relevant keyword clusters; unsupervised document clustering; weighted undirected graph partitioning; Business; Cities and towns; Clustering algorithms; Clustering methods; Educational institutions; Explosions; Information management; Information retrieval; Internet; Organizing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications and Information Technology, 2004. ISCIT 2004. IEEE International Symposium on
Print_ISBN :
0-7803-8593-4
Type :
conf
DOI :
10.1109/ISCIT.2004.1413908
Filename :
1413908
Link To Document :
بازگشت