DocumentCode
1938225
Title
Incorporting Keyword-Based Filtering to Document Classification for Email Spamming
Author
Wong, Tak-Lam ; Chow, Kai-On ; Wong, Franz
Author_Institution
City Univ. of Hong Kong, Kowloon
Volume
7
fYear
2007
fDate
19-22 Aug. 2007
Firstpage
3899
Lastpage
3904
Abstract
Email spamming causes serious problems in the Internet resulting in a huge waste of resources and attracting high attention from research society. Automatic document classification and keyword-based filtering are two kinds of techniques which have been applied to filter spam emails to achieve satisfactory results. This paper proposes a formal method by incorporating keyword-based filtering to document classification. To consider the potentially high cost of misclassification of an email to a spam email in real-word situation, a cost-sensitive evaluation metric is adopted to evaluate our approaches. We conducted extensive experiments in real-word data showing promising results.
Keywords
classification; document handling; information filtering; unsolicited e-mail; document classification; email spamming; keyword-based filtering; Bandwidth; Computer science; Costs; Electronic mail; Information filtering; Information filters; Internet; Machine learning; Uncertainty; Watches; Cost-sensitive evaluation; Documents classification; Email filtering; Keyword-based filtering;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2007 International Conference on
Conference_Location
Hong Kong
Print_ISBN
978-1-4244-0973-0
Electronic_ISBN
978-1-4244-0973-0
Type
conf
DOI
10.1109/ICMLC.2007.4370827
Filename
4370827
Link To Document