• DocumentCode
    2539678
  • Title

    A classification method for spam e-mail by Self-Organizing Map and automatically defined groups

  • Author

    Ichimura, T. ; Hara, Akira ; Kurosawa, Yoshiaki

  • Author_Institution
    Hiroshima City Univ., Hiroshima
  • fYear
    2007
  • fDate
    7-10 Oct. 2007
  • Firstpage
    2044
  • Lastpage
    2049
  • Abstract
    We have some difficulties in E-mails as a communication tool, because the number of E-mails infected with virus and/or recognized as Spam increases. Some E-mail filter softwares removes such problematic ones. However, we may mett the misjudgements for the filtering the Spam E-mail, even if the E-mail is important and then we cannot receive it. In this paper, we propose a classification method for Spam E-mail based on the results of SpamAssassin, which is the open source software to identify spam signatures. This method can learn patterns of Spam E-mails and Ham ones and correctly recognize them. First, the method divides E- mails into some categories by Self-Organizing Map(SOM) and extracts the correct judgement rules by Automatically Defined Groups(ADGs), even if the results by SpamAssassin are wrong. In order to verify the effectiveness of our proposed method, we examined approximately 3,000 E-mails.
  • Keywords
    public domain software; self-organising feature maps; unsolicited e-mail; automatically defined groups; classification method; communication tool; e-mail filter softwares; self-organizing map; software; spam e-mail; spam signatures; spamassassin; Cultural differences; Electronic mail; Information filtering; Information filters; Internet; Matched filters; Open source software; Pattern recognition; Unsolicited electronic mail; Visual databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2007. ISIC. IEEE International Conference on
  • Conference_Location
    Montreal, Que.
  • Print_ISBN
    978-1-4244-0990-7
  • Electronic_ISBN
    978-1-4244-0991-4
  • Type

    conf

  • DOI
    10.1109/ICSMC.2007.4413626
  • Filename
    4413626