مرکز منطقه ای اطلاع رساني علوم و فناوري - Active learning based spam filtering method

DocumentCode :

1673800

Title :

Active learning based spam filtering method

Author :

Zhang, Wei ; Gao, Feng ; Di Lv ; Xue, Feng

Author_Institution :

MOE KLINNS Lab., Xi´´an Jiaotong Univ., Xi´´an, China

fYear :

2010

Firstpage :

3302

Lastpage :

3306

Abstract :

Internet security is seriously threatened by spam spreading, and content-based spam filtering has become one of effective spam-filtering methods. Aiming at the practical problems, we propose an active learning based method which takes naive Bayesian means as basic classifiers. This method randomly initialize a small training set to generate basic classifiers, and then use them to classify mails, which add the most uncertain mail to training set each time to improve the classifier performance. The simulations based on the CCERT mail set show that this method not only reduces the number of mails to be labeled, but also improves classifier accuracy.

Keywords :

e-mail filters; information filtering; learning (artificial intelligence); security of data; unsolicited e-mail; Internet security; active learning based spam filtering; classifier performance; content-based spam filtering; mail classification; naive Bayesian means; spam spreading; Classification algorithms; Filtering; Learning; Machine learning; Postal services; Text categorization; Training; Active learning; Spam filtering; Text categorization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Control and Automation (WCICA), 2010 8th World Congress on

Conference_Location :

Jinan

Print_ISBN :

978-1-4244-6712-9

Type :

conf

DOI :

10.1109/WCICA.2010.5553918

Filename :

5553918

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1673800