Title :
Feature selection for image spam classification
Author :
Liu, Qiao ; Zhang, Feng-li ; Qin, Zhi-guang ; Wang, Chao ; Chen, Shuang ; Ma, Qiu-ming
Author_Institution :
Sch. of Comput. Sci. & Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
Abstract :
This paper considers the low-level feature modeling problem in image spam classification, in which most of the prevalent content based spam filters are shown to be inefficient because their OCR procedure are vulnerable to text obscuring attacks from spammers. We first built up a basic feature set through a low-level feature extraction process, and then proposed a stepwise regression method to determine the best subset automatically, which was controlled by a minimum description length criterion. Experimental results indicate that the proposed approach is very effective for the purpose of modeling spam images, and the selected feature set is applicable for practical anti-spam tasks, its performance is comparable to some other cutting-edge approaches.
Keywords :
feature extraction; image classification; unsolicited e-mail; feature extraction process; feature selection; image spam classification; spam filter; stepwise regression method; Accuracy; Computer science; Electronic mail; Feature extraction; Histograms; Image color analysis; Mathematical model;
Conference_Titel :
Communications, Circuits and Systems (ICCCAS), 2010 International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-8224-5
DOI :
10.1109/ICCCAS.2010.5581994