Title :
Spam image discrimination using support vector machine based on higher-order local autocorrelation feature extraction
Author :
Cheng, Hongrong ; Qin, Zhiguang ; Liu, Qiao ; Wan, Mingcheng
Author_Institution :
Sch. of Comput. Sci.&Eng, Univ. of Electron. Sci. & Technol. of China, Chengdu
Abstract :
In this paper, a new method is proposed for discriminating spam images from non-spam images. This method extracts edge features of a binarized image by using higher-order local autocorrelation(HLAC), and then input those features to support vector machine (SVM) for classification. Our method has three unique characteristics. First, the method extracts edge features which can represent major edge properties of an image without limitations imposed by image edgespsila directions or distributions. Second, the method can tolerate effectively slight changes of color, texture, size, layout of an image, and characteristics of text embedded in it. Third, the method is fast because of no time cost of text location and recognition. Experimental results for the public personal dataset show that the proposed method can separate spam images from non-spam images with minimum recognition rates of 98%.
Keywords :
correlation methods; feature extraction; image texture; support vector machines; HLAC; higher-order local autocorrelation feature extraction; image texture; spam image discrimination; support vector machines; text location; text recognition; Autocorrelation; Costs; Feature extraction; Filters; Image recognition; Optical character recognition software; Support vector machine classification; Support vector machines; Text recognition; Unsolicited electronic mail;
Conference_Titel :
Cybernetics and Intelligent Systems, 2008 IEEE Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-1673-8
Electronic_ISBN :
978-1-4244-1674-5
DOI :
10.1109/ICCIS.2008.4670821