DocumentCode :
1646092
Title :
Using a probable weight based Bayesian approach for spam filtering
Author :
Anaya, Saqib ; Ali, Arshad ; Ahmad, H. Farooq
Author_Institution :
Inst. of Inf. Technol., Nat. Univ. of Sci. & Technol., Rawalpindi, Pakistan
fYear :
2004
Firstpage :
340
Lastpage :
345
Abstract :
In the digital world that we live in today, Internet has merged and become an integral part of our life. It is difficult to imagine a life where there is no Internet or no email for that matter. Internet and email has resulted in disposal of huge amounts of information at everyone´s footsteps. Advent of powerful search engines such as Google® has revolutionized searches. Despite all this development, typically whenever we need something, we are presented with hundreds if not thousands of potential answers. Things seem to have gone out of control. In such a scenario, other sources of information such as online journals, emails, online newspapers and an online equivalent of just about anything and everything on paper, does not help the cause. Information has become so abundant, that we have difficulty in extracting the right and correct amount required for decision-making. This problem has been dubbed as the information overload, or too much information. This paper expects to resolve one aspect of this problem namely intelligently filtering out spam. The intelligent spam filter derives its intelligence using a combination of mathematical weights assigned to individual words appearing in each mail combined using Bayesian rule. This algorithm has achieved an average accuracy of 93 percent.
Keywords :
Bayes methods; Internet; information filtering; search engines; unsolicited e-mail; Internet; email; information overload; intelligent spam filtering; probable weight based Bayesian; search engines; Bayesian methods; Data mining; Decision making; Electronic mail; Information filtering; Information filters; Information resources; Internet; Postal services; Search engines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multitopic Conference, 2004. Proceedings of INMIC 2004. 8th International
Print_ISBN :
0-7803-8680-9
Type :
conf
DOI :
10.1109/INMIC.2004.1492900
Filename :
1492900
Link To Document :
بازگشت