DocumentCode
1657499
Title
Automatic text categorization based on Jensen-Shannon Divergence
Author
Li, Xiangdong ; Sun, Denghui ; Wuhan, Li Huang
Author_Institution
The School of Information Management, Wuhan University, Wuhan 430072, China
fYear
2011
Firstpage
1
Lastpage
4
Abstract
This paper studies the principle of text categorization in which Jensen-Shannon Divergence is used to calculate text similarity, comparing its accuracy of classification and time taking to the traditional Cosine Similarity algorithm. Experimental research shows that Jensen-Shannon Divergence algorithm will reach better results when test materials remain unchanged.
Keywords
Classification algorithms; Information retrieval; Libraries; Publishing; Research and development; Sun; Text categorization; Jensen-Shannon divergence; KNN algorithm; cosine similarity; text categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
E -Business and E -Government (ICEE), 2011 International Conference on
Conference_Location
Shanghai, China
Print_ISBN
978-1-4244-8691-5
Type
conf
DOI
10.1109/ICEBEG.2011.5882549
Filename
5882549
Link To Document