Title :
A classification method for spam e-mail by Self-Organizing Map and automatically defined groups
Author :
Ichimura, T. ; Hara, Akira ; Kurosawa, Yoshiaki
Author_Institution :
Hiroshima City Univ., Hiroshima
Abstract :
We have some difficulties in E-mails as a communication tool, because the number of E-mails infected with virus and/or recognized as Spam increases. Some E-mail filter softwares removes such problematic ones. However, we may mett the misjudgements for the filtering the Spam E-mail, even if the E-mail is important and then we cannot receive it. In this paper, we propose a classification method for Spam E-mail based on the results of SpamAssassin, which is the open source software to identify spam signatures. This method can learn patterns of Spam E-mails and Ham ones and correctly recognize them. First, the method divides E- mails into some categories by Self-Organizing Map(SOM) and extracts the correct judgement rules by Automatically Defined Groups(ADGs), even if the results by SpamAssassin are wrong. In order to verify the effectiveness of our proposed method, we examined approximately 3,000 E-mails.
Keywords :
public domain software; self-organising feature maps; unsolicited e-mail; automatically defined groups; classification method; communication tool; e-mail filter softwares; self-organizing map; software; spam e-mail; spam signatures; spamassassin; Cultural differences; Electronic mail; Information filtering; Information filters; Internet; Matched filters; Open source software; Pattern recognition; Unsolicited electronic mail; Visual databases;
Conference_Titel :
Systems, Man and Cybernetics, 2007. ISIC. IEEE International Conference on
Conference_Location :
Montreal, Que.
Print_ISBN :
978-1-4244-0990-7
Electronic_ISBN :
978-1-4244-0991-4
DOI :
10.1109/ICSMC.2007.4413626