Title :
An Evaluation of Naïve Bayesian Anti-Spam Filtering Techniques
Author :
Deshpande, Vikas P. ; Erbacher, Robert F. ; Harris, Chris
Author_Institution :
Utah State Univ., Logan
Abstract :
An efficient anti-spam filter that would block all spam, without blocking any legitimate messages is a growing need. To address this problem, we examine the effectiveness of statistically-based approaches Naive Bayesian anti-spam filters, as it is content-based and self-learning (adaptive) in nature. Additionally, we designed a derivative filter based on relative numbers of tokens. We train the filters using a large corpus of legitimate messages and spam and we test the filter using new incoming personal messages. More specifically, four filtering techniques available for a Naive Bayesian filter are evaluated. We look at the effectiveness of the technique, and we evaluate different threshold values in order to find an optimal anti-spam filter configuration. Based on cost-sensitive measures, we conclude that additional safety precautions are needed for a Bayesian anti-spam filter to be put into practice. However, our technique can make a positive contribution as a first pass filter.
Keywords :
Bayes methods; pattern classification; unsolicited e-mail; Naive Bayesian anti-spam filtering techniques; derivative filter; legitimate messages; Adaptive filters; Bayesian methods; Computer science; Conferences; Information filtering; Information filters; Safety; Statistics; Testing; Text categorization; Evaluation; Naïve Bayesian; Spam filter;
Conference_Titel :
Information Assurance and Security Workshop, 2007. IAW '07. IEEE SMC
Conference_Location :
West Point, NY
Print_ISBN :
1-4244-1304-4
Electronic_ISBN :
1-4244-1304-4
DOI :
10.1109/IAW.2007.381951