DocumentCode
1990539
Title
A New Method of Text Categorization on Imbalanced Datasets
Author
Xin-fu, Li ; Yan, Yu ; Peng, Yin
Author_Institution
Coll. of Math. & Comput. Sci., Hebei Univ., Baoding
Volume
2
fYear
2008
fDate
21-22 Dec. 2008
Firstpage
259
Lastpage
262
Abstract
This paper aims at improving the categorization performance of the small number of samples in the imbalance datasets, and dealing with data re-sampling from the perspective of data. The main idea is to make the number of various types of texts by increasing some texts. The experiment indicates that the system has improved the accuracy of text-categorization effectively.
Keywords
data handling; sampling methods; text analysis; data resampling; imbalanced datasets; text categorization; Computer science education; Educational institutions; Educational technology; Machine learning; Mathematics; Pattern recognition; Support vector machine classification; Support vector machines; Testing; Text categorization; SVM; imbalanced dataset; text categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Education Technology and Training, 2008. and 2008 International Workshop on Geoscience and Remote Sensing. ETT and GRS 2008. International Workshop on
Conference_Location
Shanghai
Print_ISBN
978-0-7695-3563-0
Type
conf
DOI
10.1109/ETTandGRS.2008.42
Filename
5070355
Link To Document