• DocumentCode
    2010788
  • Title

    Multiple Feature-Classifier Combination in Automated Text Classification

  • Author

    Busagala, Lazaro S P ; Ohyama, Wataru ; Wakabayashi, Tetsushi ; Kimura, Fumitaka

  • Author_Institution
    Sokoine Nat. Agric. Libr., Sokoine Univ. of Agric., Morogoro, Tanzania
  • fYear
    2012
  • fDate
    27-29 March 2012
  • Firstpage
    43
  • Lastpage
    47
  • Abstract
    Automatic text classification (ATC) is important in applications such as indexing and organizing electronic documents in databases leading to enhancement of information access and retrieval. We propose a method which employs various types of feature sets and learning algorithms to improve classification effectiveness. Unlike the conventional methods of multi-classifier combination, the proposed method considers the contributions of various types of feature sets and classifiers. It can therefore be known as multiple feature-classifier combination (MFC) method. In this paper we present empirical evaluation of MFC using two benchmarks of text collections to determine its effectiveness. Empirical evaluation show that MFC consistently outperformed all compared methods.
  • Keywords
    learning (artificial intelligence); pattern classification; text analysis; automated text classification; classification effectiveness; electronic document; feature set; information access; information retrieval; learning algorithm; multiple feature-classifier combination; text collection; Information retrieval; Machine learning; Principal component analysis; Support vector machines; Text categorization; Training; Vectors; Feature reduction; Feature-Classifier Combination; Multi-classifier combination; Text Classification/Categorization; ensembles;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis Systems (DAS), 2012 10th IAPR International Workshop on
  • Conference_Location
    Gold Cost, QLD
  • Print_ISBN
    978-1-4673-0868-7
  • Type

    conf

  • DOI
    10.1109/DAS.2012.56
  • Filename
    6195332