DocumentCode
2010788
Title
Multiple Feature-Classifier Combination in Automated Text Classification
Author
Busagala, Lazaro S P ; Ohyama, Wataru ; Wakabayashi, Tetsushi ; Kimura, Fumitaka
Author_Institution
Sokoine Nat. Agric. Libr., Sokoine Univ. of Agric., Morogoro, Tanzania
fYear
2012
fDate
27-29 March 2012
Firstpage
43
Lastpage
47
Abstract
Automatic text classification (ATC) is important in applications such as indexing and organizing electronic documents in databases leading to enhancement of information access and retrieval. We propose a method which employs various types of feature sets and learning algorithms to improve classification effectiveness. Unlike the conventional methods of multi-classifier combination, the proposed method considers the contributions of various types of feature sets and classifiers. It can therefore be known as multiple feature-classifier combination (MFC) method. In this paper we present empirical evaluation of MFC using two benchmarks of text collections to determine its effectiveness. Empirical evaluation show that MFC consistently outperformed all compared methods.
Keywords
learning (artificial intelligence); pattern classification; text analysis; automated text classification; classification effectiveness; electronic document; feature set; information access; information retrieval; learning algorithm; multiple feature-classifier combination; text collection; Information retrieval; Machine learning; Principal component analysis; Support vector machines; Text categorization; Training; Vectors; Feature reduction; Feature-Classifier Combination; Multi-classifier combination; Text Classification/Categorization; ensembles;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
Conference_Location
Gold Cost, QLD
Print_ISBN
978-1-4673-0868-7
Type
conf
DOI
10.1109/DAS.2012.56
Filename
6195332
Link To Document