DocumentCode :
2786572
Title :
Bayesian statistical analysis for spams
Author :
Begriche, Youcef ; Serhrouchni, Ahmed
Author_Institution :
Inst. Telecom, Telecom ParisTech, Paris, France
fYear :
2010
fDate :
10-14 Oct. 2010
Firstpage :
989
Lastpage :
992
Abstract :
This paper presents a Bayesian statistical analysis applied to the spam problem. In most anti-spam related research, generally it is assumed that the probability of a spam occurrence is equal to 0.5, which is in our opinion unrealistic. It is also assumed that in the spam message, words are considered as an independent family of words. This makes us look at how the posterior probability behaves when the a priori probability is different from 0.5 and derive the consequences of the assumption of independent words on the posterior probability. The first assumption pushes us to define a prior and find a posterior probability laws to enhance the spam detection and increase the reliability decision. This analysis differs from previous results, that used the Bayesian approach to the anti-spam issue, especially through refinement and enhancement of various probability laws.
Keywords :
Bayes methods; probability; security of data; unsolicited e-mail; Bayesian statistical analysis; a priori probability; antispam; posterior probability; probability law; reliability decision; spam detection; spam message; spam occurrence; Bayesian methods; Filtering; Niobium; Telecommunications; Training; Unsolicited electronic mail; Bayesian statistical model; Binomial law; Classification; Conditional density; Distribution attachment; Ham(H); Spam(S);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Local Computer Networks (LCN), 2010 IEEE 35th Conference on
Conference_Location :
Denver, CO
ISSN :
0742-1303
Print_ISBN :
978-1-4244-8387-7
Type :
conf
DOI :
10.1109/LCN.2010.5735846
Filename :
5735846
Link To Document :
بازگشت