• DocumentCode
    14863
  • Title

    Multilabel Text Categorization Based on Fuzzy Relevance Clustering

  • Author

    Shie-Jue Lee ; Jung-Yi Jiang

  • Author_Institution
    Dept. of Electr. Eng., Nat. Sun Yat-Sen Univ., Kaohsiung, Taiwan
  • Volume
    22
  • Issue
    6
  • fYear
    2014
  • fDate
    Dec. 2014
  • Firstpage
    1457
  • Lastpage
    1471
  • Abstract
    We propose a fuzzy based method for multilabel text classification in which a document can belong to one or more than one category. In text categorization, the number of the involved features is usually huge, causing the curse of the dimensionality problem. Besides, a category can be a nonconvex region, which is a union of several overlapping or disjoint subregions. An automatic classification system, thus, may suffer from large memory requirements or poor performance. By incorporating fuzzy techniques, our proposed method can overcome these issues. A fuzzy relevance measure is adopted to transform high-dimensional documents to low-dimensional fuzzy relevance vectors to avoid the curse of dimensionality problem. A clustering technique is used to divide the relevance space into a collection of subregions which are then combined to make up individual categories. This allows complex and nonconvex regions to be created. A number of experiments are presented to show the effectiveness of the proposed method in both performance and speed.
  • Keywords
    fuzzy set theory; pattern clustering; text analysis; clustering technique; curse-of-dimensionality problem; fuzzy based method; fuzzy relevance clustering; fuzzy relevance measure; fuzzy relevance vectors; fuzzy techniques; multilabel text categorization; Equations; Principal component analysis; Testing; Text categorization; Training; Transforms; Vectors; Clustering; dimensionality reduction; fuzzy relevance; multilabel learning; text classification;
  • fLanguage
    English
  • Journal_Title
    Fuzzy Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6706
  • Type

    jour

  • DOI
    10.1109/TFUZZ.2013.2294355
  • Filename
    6679223