DocumentCode :
2485563
Title :
Automatic Personalized Spam Filtering through Significant Word Modeling
Author :
Junejo, Khurum Nazir ; Karim, Asim
Author_Institution :
Lahore Univ. of Manage. Sci., Lahore
Volume :
2
fYear :
2007
fDate :
29-31 Oct. 2007
Firstpage :
291
Lastpage :
298
Abstract :
Typically, spam filters are built on the assumption that the characteristics of e-mails in the training set is identical to those in individual users´ inboxes on which it will be applied. This assumption is oftentimes incorrect leading to poor performance of the filter. A personalized spam filter is built by taking into account the characteristics of e-mails in individual users´ inboxes. We present an automatic approach for personalized spam filtering that does not require users´ feedback. The proposed algorithm builds a statistical model of significant spam and non-spam words from the labeled training set and then updates it in multiple passes over the unlabeled individual user´s inbox. The personalization of the model leads to improved filtering performance. We evaluate our algorithm on two publicly available datasets. The results show that our algorithm is robust and scalable, and a viable solution to the server-side personalized spam filtering problem. Moreover, it outperforms published results on one dataset and its performance is equivalent to the others on the second dataset.
Keywords :
information filtering; statistical analysis; unsolicited e-mail; automatic personalized spam filtering; available datasets; e-mails; labeled training set; statistical model; training set; word modeling; Artificial intelligence; Computer science; Conference management; Electronic mail; Feedback; Filtering algorithms; Filters; Management training; Robustness; Unsolicited electronic mail;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence, 2007. ICTAI 2007. 19th IEEE International Conference on
Conference_Location :
Patras
ISSN :
1082-3409
Print_ISBN :
978-0-7695-3015-4
Type :
conf
DOI :
10.1109/ICTAI.2007.66
Filename :
4410394
Link To Document :
بازگشت