DocumentCode :
1938225
Title :
Incorporting Keyword-Based Filtering to Document Classification for Email Spamming
Author :
Wong, Tak-Lam ; Chow, Kai-On ; Wong, Franz
Author_Institution :
City Univ. of Hong Kong, Kowloon
Volume :
7
fYear :
2007
fDate :
19-22 Aug. 2007
Firstpage :
3899
Lastpage :
3904
Abstract :
Email spamming causes serious problems in the Internet resulting in a huge waste of resources and attracting high attention from research society. Automatic document classification and keyword-based filtering are two kinds of techniques which have been applied to filter spam emails to achieve satisfactory results. This paper proposes a formal method by incorporating keyword-based filtering to document classification. To consider the potentially high cost of misclassification of an email to a spam email in real-word situation, a cost-sensitive evaluation metric is adopted to evaluate our approaches. We conducted extensive experiments in real-word data showing promising results.
Keywords :
classification; document handling; information filtering; unsolicited e-mail; document classification; email spamming; keyword-based filtering; Bandwidth; Computer science; Costs; Electronic mail; Information filtering; Information filters; Internet; Machine learning; Uncertainty; Watches; Cost-sensitive evaluation; Documents classification; Email filtering; Keyword-based filtering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2007 International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-0973-0
Electronic_ISBN :
978-1-4244-0973-0
Type :
conf
DOI :
10.1109/ICMLC.2007.4370827
Filename :
4370827
Link To Document :
بازگشت