Title :
Document Clustering Description Based on Combination Strategy
Author_Institution :
Inst. of Sci. & Tech. Inf. of China, Beijing, China
Abstract :
Document clustering description is a problem of labeling the clustered results of document collection clustering. It can help users determine whether one of the clusters is relevant to users´ information require. Therefore, labeling a clustered set of documents is an important and challenging work in document clustering applications. The DCF (description comes first) method can generate document clustering description. For the clustering description base on DCF is generate before document clustering, there is ´semantic interval´ between clustering description and cluster central vector. So, it contradicts to the intuition of ´first clustering, second description´, and decreases the readability of clustering description. A method based on combination strategy, i.e. combination of the DCF and DCL (description comes last) is proposed to solve the problem of the weak readability of clustering description in this paper. Experimental results show that the method is effective, and the method is used to describe the search result clustering.
Keywords :
document handling; pattern clustering; cluster central vector; combination strategy; description comes first method; description comes last method; document clustering description; document collection clustering; Clustering algorithms; Data mining; Frequency; Information management; Labeling;
Conference_Titel :
Innovative Computing, Information and Control (ICICIC), 2009 Fourth International Conference on
Conference_Location :
Kaohsiung
Print_ISBN :
978-1-4244-5543-0
DOI :
10.1109/ICICIC.2009.178