• DocumentCode
    2343684
  • Title

    A hybrid Naive Bayes approach for information filtering

  • Author

    Chiong, Raymond ; Bee Theng, Lau

  • Author_Institution
    Sch. of Inf. Technol., Swinburne Univ. of Technol., Kuching
  • fYear
    2008
  • fDate
    3-5 June 2008
  • Firstpage
    1003
  • Lastpage
    1007
  • Abstract
    Naive Bayes has been widely used in the field of machine learning research for many years. While it is fast and easy to implement, its performance in comparison to other machine learning methods is not ideal. In this paper, we present a hybrid approach using naive Bayes for information filtering. This approach differs from previous approaches in that it uses Multivariate Bernoulli Model and Multinomial Model successively. We report on the performance of our proposed approach using Reuters-21578 and 20 Newsgroups data. In the filtering process, we first use multivariate Bernoulli model to estimate the pre- examined probability for words appear in a document. Subsequently, the Multinomial Model is used to estimate the post-examined probability for final classification. We show that with sufficient training data, this hybrid approach can achieve higher F-measure score than using multivariate Bernoulli model or multinomial model alone. It can even achieve competitive results as compared to the highly complex learning method such as support vector machine (SVM) with less computational time.
  • Keywords
    Bayes methods; information filtering; learning (artificial intelligence); hybrid naive Bayes approach; information filtering; machine learning; multinomial model; multivariate Bernoulli model; support vector machine; Electronic mail; Frequency estimation; Information filtering; Information filters; Information technology; Learning systems; Machine learning; Support vector machine classification; Support vector machines; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Industrial Electronics and Applications, 2008. ICIEA 2008. 3rd IEEE Conference on
  • Conference_Location
    Singapore
  • Print_ISBN
    978-1-4244-1717-9
  • Electronic_ISBN
    978-1-4244-1718-6
  • Type

    conf

  • DOI
    10.1109/ICIEA.2008.4582666
  • Filename
    4582666