Title :
A fast image spam filter based on ORB
Author :
Yixin Hou ; Bo Zhao ; Honggang Zhang ; Hanbing Yan
Author_Institution :
Pattern Recognition & Intell. Syst. Lab., Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
Image spam has become a new obfuscating method to bypass conventional text based spam filters. In this paper, a new kind of image spam filtering method is proposed based on the characteristics of the spam being sent repeatedly and their contents being highly resemble to each other. We extract sub-block color histogram and ORB as image feature and run a scalable vocabulary tree to detect image spam. The system is tested on Mark Dredze´s dataset and our own Chinese image spam corpus. Experimental results demonstrate that the proposed method can achieve good accuracy while having a less than 0.02% false positive rate.
Keywords :
feature extraction; image colour analysis; information filtering; unsolicited e-mail; ORB; image feature; image spam detection; image spam filtering; scalable vocabulary tree; subblock color histogram; text based spam filter; Databases; Electronic mail; Feature extraction; Filtering; Histograms; Image color analysis; Vocabulary; ORB; Scalable vocabulary tree; Sub-block color histogram;
Conference_Titel :
Network Infrastructure and Digital Content (IC-NIDC), 2012 3rd IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4673-2201-0
DOI :
10.1109/ICNIDC.2012.6418804