• DocumentCode
    2270350
  • Title

    A novel spam image filtering framework with Multi-Label Classification

  • Author

    Cheng, Hongrong ; Qin, Zhiguang ; Fu, Chong ; Wang, Yong

  • Author_Institution
    Sch. of Comput. Sci. & Eng., Univ. of Electron. Sci. & Technol. of China, Chengdu, China
  • fYear
    2010
  • fDate
    28-30 July 2010
  • Firstpage
    282
  • Lastpage
    285
  • Abstract
    Gray images, which could reasonably be considered as either spam or ham by different recipients, present significant obstacles to conventional binary spam filtering systems. The inconsistent labels of gray images will inevitably deteriorate the overall filter performance. In this paper, we present a novel framework named BFMLC (Binary Filtering with Multi-Label Classification) to take both spam image filtering and user preferences into account. The BFMLC framework comprises two-stage classification tasks: the filter-oriented binary classification and user-oriented multi-label classification. A filter based on the BFMLC framework can not only discriminate spam images from ham images, but also classify spam image as several predefined topics. According to user preference settings on the client side, the specific spam images (gray images) are delivered to individuals. Moreover, the BFMLC framework can be generalized to deal with text, image or mixed emails. We implement a spam image filtering system based on the BFMLC framework and conduct experiments in public personal datasets. The experimental results show that the system can identify spam images with the average accuracy of 96.309% and classify spam images as predefined topics with the average precision of 89.42%.
  • Keywords
    filtering theory; image classification; unsolicited e-mail; BFMLC framework; binary spam filtering systems; gray images; multilabel classification; spam image classification; spam image filtering framework; user oriented multilabel classification; Classification tree analysis; Information filters; Principal component analysis; Unsolicited electronic mail;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications, Circuits and Systems (ICCCAS), 2010 International Conference on
  • Conference_Location
    Chengdu
  • Print_ISBN
    978-1-4244-8224-5
  • Type

    conf

  • DOI
    10.1109/ICCCAS.2010.5582001
  • Filename
    5582001