• DocumentCode
    2430116
  • Title

    Applying cascaded feature selection to SVM text categorization

  • Author

    Masuyama, Takeshi ; Nakagawa, Hiroshi

  • Author_Institution
    Inf. Technol. Center, Tokyo Univ., Japan
  • fYear
    2002
  • fDate
    2-6 Sept. 2002
  • Firstpage
    241
  • Lastpage
    245
  • Abstract
    This paper investigates the effect of a cascaded feature selection (CFS) in SVM text categorization. Unlike existing feature selections, our method (CFS) has two advantages. One can make use of the characteristic of each feature (word). Another is that unnecessary test documents for a category, which should be categorized into a negative set, can be removed in the first step. Compared with the method which does not apply CFS, our method achieved significant good performance especially about the categories which contain a small number of training documents.
  • Keywords
    data mining; feature extraction; learning (artificial intelligence); text analysis; SVM text categorization; cascaded feature selection; test documents; training documents; Humans; Information technology; Organizing; Quality management; Search engines; Support vector machine classification; Support vector machines; Testing; Text categorization; Web sites;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database and Expert Systems Applications, 2002. Proceedings. 13th International Workshop on
  • ISSN
    1529-4188
  • Print_ISBN
    0-7695-1668-8
  • Type

    conf

  • DOI
    10.1109/DEXA.2002.1045905
  • Filename
    1045905