• DocumentCode
    1938225
  • Title

    Incorporting Keyword-Based Filtering to Document Classification for Email Spamming

  • Author

    Wong, Tak-Lam ; Chow, Kai-On ; Wong, Franz

  • Author_Institution
    City Univ. of Hong Kong, Kowloon
  • Volume
    7
  • fYear
    2007
  • fDate
    19-22 Aug. 2007
  • Firstpage
    3899
  • Lastpage
    3904
  • Abstract
    Email spamming causes serious problems in the Internet resulting in a huge waste of resources and attracting high attention from research society. Automatic document classification and keyword-based filtering are two kinds of techniques which have been applied to filter spam emails to achieve satisfactory results. This paper proposes a formal method by incorporating keyword-based filtering to document classification. To consider the potentially high cost of misclassification of an email to a spam email in real-word situation, a cost-sensitive evaluation metric is adopted to evaluate our approaches. We conducted extensive experiments in real-word data showing promising results.
  • Keywords
    classification; document handling; information filtering; unsolicited e-mail; document classification; email spamming; keyword-based filtering; Bandwidth; Computer science; Costs; Electronic mail; Information filtering; Information filters; Internet; Machine learning; Uncertainty; Watches; Cost-sensitive evaluation; Documents classification; Email filtering; Keyword-based filtering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2007 International Conference on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-1-4244-0973-0
  • Electronic_ISBN
    978-1-4244-0973-0
  • Type

    conf

  • DOI
    10.1109/ICMLC.2007.4370827
  • Filename
    4370827