Title :
A Fuzzy Similarity-Based Approach for Multi-label Document Classification
Author :
Tsai, Shian-Chi ; Jiang, Jung-Yi ; Wu, ChunDer ; Lee, Shie-Jue
Author_Institution :
Dept. of Electr. Eng., Nat. Sun Yat-Sen Univ., Kaohsiung, Taiwan
Abstract :
Multi-label document classification concerns the determination of categories in the situation where one document may belong to more than one category. In this paper we propose a fuzzy similarity-based approach for multi-label document classification. For a test document, the scores of its relevance to the classes are calculated based on a modified fuzzy similarity measure. The test document is then decided to belong to every class whose score passes a threshold. To make the system adaptive, we provide a heuristic approach to find a score threshold automatically for each class. Experimental results show that our proposed method is more effective and efficient than other existing methods.
Keywords :
document handling; information retrieval; fuzzy similarity-based approach; multilabel document classification; Bayesian methods; Bibliographies; Computer industry; Computer science; Fuzzy sets; Information retrieval; Machine learning; Motion pictures; Supervised learning; Testing; Multi-label document classification; fuzzy similarity measure; information retrieval; relevance score;
Conference_Titel :
Computer Science and Engineering, 2009. WCSE '09. Second International Workshop on
Conference_Location :
Qingdao
Print_ISBN :
978-0-7695-3881-5
DOI :
10.1109/WCSE.2009.766