DocumentCode
1592276
Title
Adaptive filtering of spam
Author
Pelletier, L. ; Almhana, J. ; Choulakian, V.
Author_Institution
GRETI, Moncton Univ., NB, Canada
fYear
2004
Firstpage
218
Lastpage
224
Abstract
We present a new spam filter which acts as an additional layer in the spam filtering process. This filter is based on what we call a representative vocabulary. Spam e-mails are divided into categories in which each category is represented by a set of tokens which form a representative text (RT). Tokens are strings of characters (words, sentences, or sometimes meaningless strings of characters). This RT is used to compute a resemblance ratio with incoming e-mails. With this ratio, we decide whether the incoming e-mail is a spam. This filter was implemented and integrated to Spamihilator software. Some experimental and interesting results are presented.
Keywords
text analysis; unsolicited e-mail; vocabulary; Spamihilator software; adaptive filtering; adaptive spam filtering; character strings; representative text; representative vocabulary; resemblance ratio; unsolicited e-mail; Adaptive filters; Bandwidth; Bayesian methods; Costs; Electronic mail; Information filtering; Information filters; Unsolicited electronic mail; Vocabulary; Web and internet services;
fLanguage
English
Publisher
ieee
Conference_Titel
Communication Networks and Services Research, 2004. Proceedings. Second Annual Conference on
Print_ISBN
0-7695-2096-0
Type
conf
DOI
10.1109/DNSR.2004.1344731
Filename
1344731
Link To Document