• DocumentCode
    3353459
  • Title

    A Fuzzy Similarity-Based Approach for Multi-label Document Classification

  • Author

    Tsai, Shian-Chi ; Jiang, Jung-Yi ; Wu, ChunDer ; Lee, Shie-Jue

  • Author_Institution
    Dept. of Electr. Eng., Nat. Sun Yat-Sen Univ., Kaohsiung, Taiwan
  • Volume
    2
  • fYear
    2009
  • fDate
    28-30 Oct. 2009
  • Firstpage
    59
  • Lastpage
    63
  • Abstract
    Multi-label document classification concerns the determination of categories in the situation where one document may belong to more than one category. In this paper we propose a fuzzy similarity-based approach for multi-label document classification. For a test document, the scores of its relevance to the classes are calculated based on a modified fuzzy similarity measure. The test document is then decided to belong to every class whose score passes a threshold. To make the system adaptive, we provide a heuristic approach to find a score threshold automatically for each class. Experimental results show that our proposed method is more effective and efficient than other existing methods.
  • Keywords
    document handling; information retrieval; fuzzy similarity-based approach; multilabel document classification; Bayesian methods; Bibliographies; Computer industry; Computer science; Fuzzy sets; Information retrieval; Machine learning; Motion pictures; Supervised learning; Testing; Multi-label document classification; fuzzy similarity measure; information retrieval; relevance score;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Engineering, 2009. WCSE '09. Second International Workshop on
  • Conference_Location
    Qingdao
  • Print_ISBN
    978-0-7695-3881-5
  • Type

    conf

  • DOI
    10.1109/WCSE.2009.766
  • Filename
    5403378