Title :
E-mail Spam Filtering Using Support Vector Machines with Selection of Kernel Function Parameters
Author :
Wei-Chih, Hsu ; Yu, Tsan-Ying
Author_Institution :
Dept. of Comput. & Commun., Nat. Kaohsiung First Univ. of Sci. & Technol., Kaohsiung, Taiwan
Abstract :
Support Vector Machines (SVM) is a powerful classification technique in data mining and has been successfully applied to many real-world applications. Parameter selection of SVM will affect classification performance much during training process. However, parameter selection of SVM is usually identified by experience or grid search (GS). GS is simple and easily implemented, but it is very time-consuming. In this study, Taguchi method is proposed for improving GS and used to optimize the SVMbased E-mail Spam Filtering model. It is easy to implement by orthogonal arrays without iteration. A real-world mail dataset is selected to demonstrate the effectiveness and feasibility of the method. The results show that the Taguchi method can find the effective model with high classification accuracy and good robustness.
Keywords :
e-mail filters; grid computing; operating system kernels; optimisation; pattern classification; support vector machines; unsolicited e-mail; SVM; Taguchi method; data mining; e-mail spam filtering; grid search; kernel function parameters; parameter selection; support vector machines; Data mining; Electronic mail; Filtering; Kernel; Optimization methods; Postal services; Robustness; Support vector machine classification; Support vector machines; Unsolicited electronic mail;
Conference_Titel :
Innovative Computing, Information and Control (ICICIC), 2009 Fourth International Conference on
Conference_Location :
Kaohsiung
Print_ISBN :
978-1-4244-5543-0
DOI :
10.1109/ICICIC.2009.184