DocumentCode :
2414430
Title :
Literature Clustering using Citation Semantics
Author :
Tuanjie Tong ; Dinakarpandian, D. ; Yugyung Lee
fYear :
2009
fDate :
5-8 Jan. 2009
Firstpage :
1
Lastpage :
10
Abstract :
Clustering is a common and powerful technique for statistical data analysis, document categorization and topic discovery. The majority of traditional clustering methods, especially for document clustering, are based on the vector space model for distance measure, where the vector is the word profile of a document in the context of the entire corpus. However, algorithms using this measure achieve limited accuracy. In this paper, we propose a semantic measure which incorporates citation semantics (Citonomy) into literature (document) clustering. Our experimental results show that the performance of clustering can be substantially improved by combining Citonomy and vector space measures.
Keywords :
citation analysis; data analysis; document handling; pattern clustering; statistical analysis; citation semantic; distance measurement; document categorization; document clustering; literature clustering; statistical data analysis; topic discovery; vector space model; Clustering algorithms; Clustering methods; Context modeling; Data analysis; Extraterrestrial measurements; Frequency; Iterative algorithms; Ontologies; Partitioning algorithms; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
System Sciences, 2009. HICSS '09. 42nd Hawaii International Conference on
Conference_Location :
Big Island, HI
ISSN :
1530-1605
Print_ISBN :
978-0-7695-3450-3
Type :
conf
DOI :
10.1109/HICSS.2009.294
Filename :
4755482
Link To Document :
بازگشت