Title :
An Effective Algorithm for Improving the Performance of Naive Bayes for Text Classification
Author_Institution :
Higher Vocational Coll., Shanghai Univ. Of Eng. Sci., Shanghai, China
Abstract :
Naive Bayes algorithm is uncomplicated and effective in text classification and experiments. However, its performance is often imperfect because it does not model text well, and by inappropriate feature selection and some disadvantages of the Naive Bayes itself. This paper makes some modifications for Naive Bayes to improve the performance of Naive Bayes and the effect, condition as well, on categorization. Finally, the paper adopts this algorithm in Spam Filter categorization, a quite typical text classification. Some experiments were done with this method; results were compared with its previous method.
Keywords :
Bayes methods; learning (artificial intelligence); pattern classification; text analysis; feature selection; naive Bayes algorithm; spam filter categorization; text classification; Application software; Educational institutions; Electronic mail; Frequency; High performance computing; Information filtering; Information filters; Machine learning; Research and development; Text categorization; component; machine learning; naive Bayes classification; text classification; words frequencies;
Conference_Titel :
Computer Research and Development, 2010 Second International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-0-7695-4043-6
DOI :
10.1109/ICCRD.2010.160