DocumentCode :
593953
Title :
Using Incremental Addition to Evaluate the Grouping Quality of Document Classification
Author :
Yi-Jen Su ; Chuan-Wang Chang ; Jian-Cheng Wun ; Wei-Lin Hsu ; Yue-Qun Chen
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Shu-Te Univ., Kaohsiung, Taiwan
fYear :
2012
fDate :
25-28 Aug. 2012
Firstpage :
380
Lastpage :
383
Abstract :
Most previous research on document classification assigned only one or two category tags to each document. Furthermore, the tagged items are rarely incorporated into their topic groups in subsequent classification work though it would conceivably enhance classification efficiency. with the modularity method, this research incrementally adds the classified documents to their topic groups after the recognition process to examine the changes in grouping quality. the result shows that social network analysis demonstrates great potential for automatic document classification, especially in identifying citation networks embedded in research papers and reference lists. a modified TF-IDF technique calculates the weight of each keyword in the topic groups. All the papers under study are collected from three journals in IEEE Computer Society collection published from 1979 to 2011.
Keywords :
data analysis; document handling; pattern classification; TF-IDF technique; classification efficiency; document category tag; document classification; grouping quality; incremental addition; modularity method; recognition process; social network analysis; term frequency-inverse document frequency; Citation analysis; Cities and towns; Classification algorithms; Communities; Educational institutions; IEEE Computer Society; Social network services; Automatic Document Classification; Citation Analysis; Social Network Analysis; TF-IDF;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Genetic and Evolutionary Computing (ICGEC), 2012 Sixth International Conference on
Conference_Location :
Kitakushu
Print_ISBN :
978-1-4673-2138-9
Type :
conf
DOI :
10.1109/ICGEC.2012.148
Filename :
6457282
Link To Document :
بازگشت