Title :
A hybrid Naive Bayes approach for information filtering
Author :
Chiong, Raymond ; Bee Theng, Lau
Author_Institution :
Sch. of Inf. Technol., Swinburne Univ. of Technol., Kuching
Abstract :
Naive Bayes has been widely used in the field of machine learning research for many years. While it is fast and easy to implement, its performance in comparison to other machine learning methods is not ideal. In this paper, we present a hybrid approach using naive Bayes for information filtering. This approach differs from previous approaches in that it uses Multivariate Bernoulli Model and Multinomial Model successively. We report on the performance of our proposed approach using Reuters-21578 and 20 Newsgroups data. In the filtering process, we first use multivariate Bernoulli model to estimate the pre- examined probability for words appear in a document. Subsequently, the Multinomial Model is used to estimate the post-examined probability for final classification. We show that with sufficient training data, this hybrid approach can achieve higher F-measure score than using multivariate Bernoulli model or multinomial model alone. It can even achieve competitive results as compared to the highly complex learning method such as support vector machine (SVM) with less computational time.
Keywords :
Bayes methods; information filtering; learning (artificial intelligence); hybrid naive Bayes approach; information filtering; machine learning; multinomial model; multivariate Bernoulli model; support vector machine; Electronic mail; Frequency estimation; Information filtering; Information filters; Information technology; Learning systems; Machine learning; Support vector machine classification; Support vector machines; Training data;
Conference_Titel :
Industrial Electronics and Applications, 2008. ICIEA 2008. 3rd IEEE Conference on
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-1717-9
Electronic_ISBN :
978-1-4244-1718-6
DOI :
10.1109/ICIEA.2008.4582666