Title :
Web Service-Enabled Spam Filtering with Naïve Bayes Classification
Author :
Wanqing You ; Kai Qian ; Dan Lo ; Bhattacharya, Prabir ; Minzhe Guo ; Ying Qian
Author_Institution :
Dept. of Comput. Sci., Southern Polytech. State Univ., Marietta, GA, USA
fDate :
March 30 2015-April 2 2015
Abstract :
Electronic mail has nowadays become a convenient and inexpensive way for communication regardless of the distance. However, an increasing volume of unsolicited emails is bringing down the productivity dramatically. There is a need for reliable anti-spam filters to separate such messages from legitimate ones. The Naïve Bayesian classifier is suggested as an effective engine to pick out spam emails. We have developed an anti-spam filter that employs this content-based classifier. This statistic-based classifier was trained on Enron Spam Dataset, a well-known spam/legitimate email dataset. We developed this filter as a Web Service, which would consume the emails user uploads and give back the predicted probability that in what degree the given email is spam. This engine was achieved by Rest easy technology, and consists three phases to train pre-labeled emails and then apply Naïve Bays theorem to calculate email´s Spamicity.
Keywords :
Bayes methods; Web services; pattern classification; statistical analysis; unsolicited e-mail; Enron Spam dataset; Web service-enabled spam filtering; content-based classifier; electronic mail; email spamicity; legitimate email dataset; naïve Bayes classification; rest easy technology; spam email dataset; spam emails; statistic-based classifier; unsolicited emails; Bayes methods; Filtering; Training; Unsolicited electronic mail; Web services; Big data; Naïve Bayesian classifier; Spam filter; Web Services;
Conference_Titel :
Big Data Computing Service and Applications (BigDataService), 2015 IEEE First International Conference on
Conference_Location :
Redwood City, CA
DOI :
10.1109/BigDataService.2015.19