• DocumentCode
    1657499
  • Title

    Automatic text categorization based on Jensen-Shannon Divergence

  • Author

    Li, Xiangdong ; Sun, Denghui ; Wuhan, Li Huang

  • Author_Institution
    The School of Information Management, Wuhan University, Wuhan 430072, China
  • fYear
    2011
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    This paper studies the principle of text categorization in which Jensen-Shannon Divergence is used to calculate text similarity, comparing its accuracy of classification and time taking to the traditional Cosine Similarity algorithm. Experimental research shows that Jensen-Shannon Divergence algorithm will reach better results when test materials remain unchanged.
  • Keywords
    Classification algorithms; Information retrieval; Libraries; Publishing; Research and development; Sun; Text categorization; Jensen-Shannon divergence; KNN algorithm; cosine similarity; text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    E -Business and E -Government (ICEE), 2011 International Conference on
  • Conference_Location
    Shanghai, China
  • Print_ISBN
    978-1-4244-8691-5
  • Type

    conf

  • DOI
    10.1109/ICEBEG.2011.5882549
  • Filename
    5882549