• DocumentCode
    1990539
  • Title

    A New Method of Text Categorization on Imbalanced Datasets

  • Author

    Xin-fu, Li ; Yan, Yu ; Peng, Yin

  • Author_Institution
    Coll. of Math. & Comput. Sci., Hebei Univ., Baoding
  • Volume
    2
  • fYear
    2008
  • fDate
    21-22 Dec. 2008
  • Firstpage
    259
  • Lastpage
    262
  • Abstract
    This paper aims at improving the categorization performance of the small number of samples in the imbalance datasets, and dealing with data re-sampling from the perspective of data. The main idea is to make the number of various types of texts by increasing some texts. The experiment indicates that the system has improved the accuracy of text-categorization effectively.
  • Keywords
    data handling; sampling methods; text analysis; data resampling; imbalanced datasets; text categorization; Computer science education; Educational institutions; Educational technology; Machine learning; Mathematics; Pattern recognition; Support vector machine classification; Support vector machines; Testing; Text categorization; SVM; imbalanced dataset; text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Education Technology and Training, 2008. and 2008 International Workshop on Geoscience and Remote Sensing. ETT and GRS 2008. International Workshop on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-0-7695-3563-0
  • Type

    conf

  • DOI
    10.1109/ETTandGRS.2008.42
  • Filename
    5070355