DocumentCode :
2622191
Title :
Email Categorization Using Multi-stage Classification Technique
Author :
Islam, Md Rafiqul ; Zhou, Wanlei
Author_Institution :
Deakin Univ., Melbourne
fYear :
2007
fDate :
3-6 Dec. 2007
Firstpage :
51
Lastpage :
58
Abstract :
This paper presents an innovative email categorization using a serialized multi-stage classification ensembles technique. Many approaches are used in practice for email categorization to control the menace of spam emails in different ways. Content-based email categorization employs filtering techniques using classification algorithms to learn to predict spam e-mails given a corpus of training e-mails. This process achieves a substantial performance with some amount of FP tradeoffs. It has been studied and investigated with different classification algorithms and found that the outputs of the classifiers vary from one classifier to another with same email corpora. In this paper we have proposed a multi-stage classification technique using different popular learning algorithms with an analyser which reduces the FP (false positive) problems substantially and increases classification accuracy compared to similar existing techniques.
Keywords :
filtering theory; learning (artificial intelligence); pattern classification; unsolicited e-mail; content-based email categorization; false positive problems; filtering techniques; learning algorithms; serialized multistage classification ensembles technique; spam emails; training e-mails; Classification algorithms; Costs; Distributed computing; Electronic mail; Feedback; Filtering; Humans; Information technology; Internet; Machine learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Computing, Applications and Technologies, 2007. PDCAT '07. Eighth International Conference on
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7695-3049-4
Type :
conf
DOI :
10.1109/PDCAT.2007.71
Filename :
4420141
Link To Document :
بازگشت