• DocumentCode
    515425
  • Title

    Naive Bayes Classifier based Arabic document categorization

  • Author

    Noaman, Hatem M. ; Elmougy, Samir ; Ghoneim, Ahmed ; Hamza, Taher

  • Author_Institution
    Fac. of Comput. & Inf. Sci., Mansoura Univ., Mansoura, Egypt
  • fYear
    2010
  • fDate
    28-30 March 2010
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Text Categorization aims to assign an electronic document to one or more categories based on its contents. Due to the rapid growth of the number of online Arabic documents, the information libraries and Arabic document corpus, automatic Arabic document classification becomes an important task. This paper suggests the use of rooting algorithm with Nai¿ve Bayes Classifier to the problem of document categorization of Arabic language and reports the algorithm performance in terms of error rate, accuracy, and micro-average recall measures. Our experimental study shows that using rooting algorithm with Nai¿ve Bayes (NB) Classifier gives ~62.23% average accuracy and decreases the dimensionality of the training documents.
  • Keywords
    belief networks; natural language processing; text analysis; Arabic document categorization; electronic document; naive Bayes classifier; rooting algorithm; text categorization; Classification tree analysis; Inference algorithms; Information filtering; Information filters; Information retrieval; Machine learning; Machine learning algorithms; Niobium; Support vector machines; Text categorization; Naïve Bayes classifier; document categorization; machine learning; natural language processing for Arabic language;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Informatics and Systems (INFOS), 2010 The 7th International Conference on
  • Conference_Location
    Cairo
  • Print_ISBN
    978-1-4244-5828-8
  • Type

    conf

  • Filename
    5461819