DocumentCode :
2270162
Title :
Feature selection for image spam classification
Author :
Liu, Qiao ; Zhang, Feng-li ; Qin, Zhi-guang ; Wang, Chao ; Chen, Shuang ; Ma, Qiu-ming
Author_Institution :
Sch. of Comput. Sci. & Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
fYear :
2010
fDate :
28-30 July 2010
Firstpage :
294
Lastpage :
297
Abstract :
This paper considers the low-level feature modeling problem in image spam classification, in which most of the prevalent content based spam filters are shown to be inefficient because their OCR procedure are vulnerable to text obscuring attacks from spammers. We first built up a basic feature set through a low-level feature extraction process, and then proposed a stepwise regression method to determine the best subset automatically, which was controlled by a minimum description length criterion. Experimental results indicate that the proposed approach is very effective for the purpose of modeling spam images, and the selected feature set is applicable for practical anti-spam tasks, its performance is comparable to some other cutting-edge approaches.
Keywords :
feature extraction; image classification; unsolicited e-mail; feature extraction process; feature selection; image spam classification; spam filter; stepwise regression method; Accuracy; Computer science; Electronic mail; Feature extraction; Histograms; Image color analysis; Mathematical model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications, Circuits and Systems (ICCCAS), 2010 International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-8224-5
Type :
conf
DOI :
10.1109/ICCCAS.2010.5581994
Filename :
5581994
Link To Document :
بازگشت