Title :
A novel spam image filtering framework with Multi-Label Classification
Author :
Cheng, Hongrong ; Qin, Zhiguang ; Fu, Chong ; Wang, Yong
Author_Institution :
Sch. of Comput. Sci. & Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
Abstract :
Gray images, which could reasonably be considered as either spam or ham by different recipients, present significant obstacles to conventional binary spam filtering systems. The inconsistent labels of gray images will inevitably deteriorate the overall filter performance. In this paper, we present a novel framework named BFMLC (Binary Filtering with Multi-Label Classification) to take both spam image filtering and user preferences into account. The BFMLC framework comprises two-stage classification tasks: the filter-oriented binary classification and user-oriented multi-label classification. A filter based on the BFMLC framework can not only discriminate spam images from ham images, but also classify spam image as several predefined topics. According to user preference settings on the client side, the specific spam images (gray images) are delivered to individuals. Moreover, the BFMLC framework can be generalized to deal with text, image or mixed emails. We implement a spam image filtering system based on the BFMLC framework and conduct experiments in public personal datasets. The experimental results show that the system can identify spam images with the average accuracy of 96.309% and classify spam images as predefined topics with the average precision of 89.42%.
Keywords :
filtering theory; image classification; unsolicited e-mail; BFMLC framework; binary spam filtering systems; gray images; multilabel classification; spam image classification; spam image filtering framework; user oriented multilabel classification; Classification tree analysis; Information filters; Principal component analysis; Unsolicited electronic mail;
Conference_Titel :
Communications, Circuits and Systems (ICCCAS), 2010 International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-8224-5
DOI :
10.1109/ICCCAS.2010.5582001