Title :
An architecture of active learning SVMs for spam
Author :
Kunlun, Li ; Houkuan, Huang
Author_Institution :
Dept. of Comput. Sci. & Technol., Northern Jiaotong Univ., Beijing, China
Abstract :
We propose a new method for spam categorization based on support vector machines (SVMs) using active learning strategy. We study the use of support vector machines in classifying e-mail as spam or nonspam. It analyzes the particular properties of our special task and identifies why SVMs are appropriate for dealing with spam. Instead of using a randomly selected training set, the learner has access to a pool of unlabeled instances and can request the labels for some number of them. We introduce a new method for choosing which instances to request next.
Keywords :
electronic mail; learning (artificial intelligence); learning automata; signal classification; SVM; active learning architecture; e-mail classification; feature representation; junk mail; spam classification; support vector machines; Computer architecture; Computer science; Electronic mail; Machine learning; Postal services; Risk management; Support vector machine classification; Support vector machines; Unsolicited electronic mail; Virtual colonoscopy;
Conference_Titel :
Signal Processing, 2002 6th International Conference on
Print_ISBN :
0-7803-7488-6
DOI :
10.1109/ICOSP.2002.1180017