• DocumentCode
    1592276
  • Title

    Adaptive filtering of spam

  • Author

    Pelletier, L. ; Almhana, J. ; Choulakian, V.

  • Author_Institution
    GRETI, Moncton Univ., NB, Canada
  • fYear
    2004
  • Firstpage
    218
  • Lastpage
    224
  • Abstract
    We present a new spam filter which acts as an additional layer in the spam filtering process. This filter is based on what we call a representative vocabulary. Spam e-mails are divided into categories in which each category is represented by a set of tokens which form a representative text (RT). Tokens are strings of characters (words, sentences, or sometimes meaningless strings of characters). This RT is used to compute a resemblance ratio with incoming e-mails. With this ratio, we decide whether the incoming e-mail is a spam. This filter was implemented and integrated to Spamihilator software. Some experimental and interesting results are presented.
  • Keywords
    text analysis; unsolicited e-mail; vocabulary; Spamihilator software; adaptive filtering; adaptive spam filtering; character strings; representative text; representative vocabulary; resemblance ratio; unsolicited e-mail; Adaptive filters; Bandwidth; Bayesian methods; Costs; Electronic mail; Information filtering; Information filters; Unsolicited electronic mail; Vocabulary; Web and internet services;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communication Networks and Services Research, 2004. Proceedings. Second Annual Conference on
  • Print_ISBN
    0-7695-2096-0
  • Type

    conf

  • DOI
    10.1109/DNSR.2004.1344731
  • Filename
    1344731