DocumentCode
2259551
Title
Automatic text classification using modified centroid classifier
Author
Elmarhumy, Mahmoud ; Fattah, M.A. ; Ren, Fuji
Author_Institution
Fac. of Eng., Univ. of Tokushima, Tokushima, Japan
fYear
2009
fDate
24-27 Sept. 2009
Firstpage
1
Lastpage
4
Abstract
This work proposes an approach to address the problem of inductive bias or model misfit incurred by the centroid classifier assumption to enhance the automatic text classification task. This approach is a trainable classifier, which takes into account tfidf as a text feature. The main idea of the proposed approach is to take advantage of the most similar training errors to the classification model to successively update it based on a certain threshold. The proposed approach is simple to implement and flexible. The proposed approach performance is measured at several threshold values on the Reuters -21578 text categorization test collection. The experimental results show that the proposed approach can improve the performance of centroid classifier.
Keywords
data mining; pattern classification; text analysis; automatic text classification; data mining; modified centroid classifier; text categorization; text feature; Bayesian methods; Classification tree analysis; Data mining; Error correction; Internet; Machine learning; Organizing; Supervised learning; Testing; Text categorization; Data mining; Text classification; centroid classifier; text categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering, 2009. NLP-KE 2009. International Conference on
Conference_Location
Dalian
Print_ISBN
978-1-4244-4538-7
Electronic_ISBN
978-1-4244-4540-0
Type
conf
DOI
10.1109/NLPKE.2009.5313757
Filename
5313757
Link To Document