• DocumentCode
    1686760
  • Title

    Controlling spam Emails at the routers

  • Author

    Agrawal, Banit ; Kumar, Nitin ; Molle, Mart

  • Author_Institution
    Dept. of Comput. Sci. & Eng., California Univ., Riverside, CA, USA
  • Volume
    3
  • fYear
    2005
  • Firstpage
    1588
  • Abstract
    Like it or not, unsolicited bulk commercial Email (aka "spam") has become a regular menu item on the Internet information diet. Every day, millions of people find their Email in-boxes clogged with vast quantities of spam. Moreover, the daily replenishment of all those in-boxes with new spam also consumes significant amount of network bandwidth. Dealing with spam is like fighting a battle against a large army; the most effective approach is to employ multiple tactics. However, almost all spam control methods that have been proposed and implemented follow the same basic theme of establishing a "front line" of defense at the end-user level. Thus, in this paper we propose a method for blocking the supply lines. More specifically, we identify spam at the router level and control it via rate limiting. Spam identification is done in two phases. In the first phase, we identify the bulk stream of Email messages and in second phase we apply Bayesian classifier to identify whether it is a spam. If a bulk Email stream is classified as a spam then we rate limit it (e.g. no more than one copy per minute). Our proposed method exploits the short timespan delivery and bulkiness of spam Emails. We use publicly available spam corpus to evaluate our proposed scheme and in the other set of experiments, we work on one month sanitized log of our department Emails to provide the representative results.
  • Keywords
    Internet; belief networks; pattern classification; telecommunication congestion control; telecommunication network routing; unsolicited e-mail; Bayesian classifier; Internet information; bandwidth consumption; end-user level; front line defense; network router; short timespan delivery; spam Email control; spam identification; unsolicited bulk commercial Email; Bandwidth; Bayesian methods; Communication system traffic control; Computer science; Electronic mail; Internet; Network servers; Probes; Unsolicited electronic mail; Web server;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications, 2005. ICC 2005. 2005 IEEE International Conference on
  • Print_ISBN
    0-7803-8938-7
  • Type

    conf

  • DOI
    10.1109/ICC.2005.1494611
  • Filename
    1494611