Title :
Text and image based spam email classification using KNN, Naïve Bayes and Reverse DBSCAN algorithm
Author :
Harisinghaney, Anirudh ; Dixit, Abhishek ; Gupta, Swastik ; Arora, Abhishek
Author_Institution :
CSE/IT Dept., Jaypee Inst. of Inf. Technol., Noida, India
Abstract :
Internet has changed the way of communication, which has become more and more concentrated on emails. Emails, text messages and online messenger chatting have become part and parcel of our lives. Out of all these communications, emails are more prone to exploitation. Thus, various email providers employ algorithms to filter emails based on spam and ham. In this research paper, our prime aim is to detect text as well as image based spam emails. To achieve the objective we applied three algorithms namely: KNN algorithm, Naïve Bayes algorithm and reverse DBSCAN algorithm. Pre-processing of email text before executing the algorithms is used to make them predict better. This paper uses Enron corpus´s dataset of spam and ham emails. In this research paper, we provide comparison performance of all three algorithms based on four measuring factors namely: precision, sensitivity, specificity and accuracy. We are able to attain good accuracy by all the three algorithms. The results have shown comparison of all three algorithms applied on same data set.
Keywords :
Bayes methods; image classification; neural nets; text analysis; text detection; unsolicited e-mail; Enron corpus dataset; Internet; KNN algorithm; Naïve Bayes algorithm; email text pre-processing; image based spam email classification; online messenger chatting; reverse DBSCAN algorithm; text based spam email classification; text detection; text messages; CAPTCHAs; Classification algorithms; Computers; Electronic mail; Image resolution; Technological innovation; Viruses (medical); Ham; Image Spam; KNN; Naïve Bayes; Spam; reverse DBSCAN;
Conference_Titel :
Optimization, Reliabilty, and Information Technology (ICROIT), 2014 International Conference on
Conference_Location :
Faridabad
Print_ISBN :
978-1-4799-3958-9
DOI :
10.1109/ICROIT.2014.6798302