• DocumentCode
    2425430
  • Title

    Combination Methodologies of Text Classifier: Design and Implementation

  • Author

    Bai Rujiang ; Wang Xiaoyue

  • Author_Institution
    Shandong Univ. of Technol. Libr., Zibo
  • Volume
    4
  • fYear
    2007
  • fDate
    24-27 Aug. 2007
  • Firstpage
    21
  • Lastpage
    25
  • Abstract
    Support vector machines, one of the most population techniques for classification, have been widely used in many application areas. The kernel parameters setting for SVM in a training process impacts on the classification accuracy. Feature selection is another factor that impacts classification accuracy .The objective of this work is to reduce the dimension of feature vectors, optimizing the parameters to improve the SVM classification accuracy and speed. We present rough set method for feature reduce and a genetic algorithm approach for feature selection and parameters optimization to solve this kind of problem. We tried Reuters 21578 using the proposed method. Experimental results indicate, compared with the traditional methods, our proposed method significantly improves the classification accuracy and has fewer input features for support vector machines.
  • Keywords
    classification; genetic algorithms; rough set theory; support vector machines; text analysis; Reuters 21578; classification accuracy; feature selection; feature vectors; genetic algorithm; rough set method; support vector machines; text classifier; Genetic algorithms; Instruments; Kernel; Libraries; Optimization methods; Organizing; Rough sets; Support vector machine classification; Support vector machines; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
  • Conference_Location
    Haikou
  • Print_ISBN
    978-0-7695-2874-8
  • Type

    conf

  • DOI
    10.1109/FSKD.2007.222
  • Filename
    4406346