DocumentCode :
2010788
Title :
Multiple Feature-Classifier Combination in Automated Text Classification
Author :
Busagala, Lazaro S P ; Ohyama, Wataru ; Wakabayashi, Tetsushi ; Kimura, Fumitaka
Author_Institution :
Sokoine Nat. Agric. Libr., Sokoine Univ. of Agric., Morogoro, Tanzania
fYear :
2012
fDate :
27-29 March 2012
Firstpage :
43
Lastpage :
47
Abstract :
Automatic text classification (ATC) is important in applications such as indexing and organizing electronic documents in databases leading to enhancement of information access and retrieval. We propose a method which employs various types of feature sets and learning algorithms to improve classification effectiveness. Unlike the conventional methods of multi-classifier combination, the proposed method considers the contributions of various types of feature sets and classifiers. It can therefore be known as multiple feature-classifier combination (MFC) method. In this paper we present empirical evaluation of MFC using two benchmarks of text collections to determine its effectiveness. Empirical evaluation show that MFC consistently outperformed all compared methods.
Keywords :
learning (artificial intelligence); pattern classification; text analysis; automated text classification; classification effectiveness; electronic document; feature set; information access; information retrieval; learning algorithm; multiple feature-classifier combination; text collection; Information retrieval; Machine learning; Principal component analysis; Support vector machines; Text categorization; Training; Vectors; Feature reduction; Feature-Classifier Combination; Multi-classifier combination; Text Classification/Categorization; ensembles;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
Conference_Location :
Gold Cost, QLD
Print_ISBN :
978-1-4673-0868-7
Type :
conf
DOI :
10.1109/DAS.2012.56
Filename :
6195332
Link To Document :
بازگشت