Title :
Using Multi-phase Cost-Sensitive Learning to Filtering Spam
Author :
Li, Wenbin ; Cheng, Yiying ; Liu, TaiFeng ; Zhang, Xindong ; Zhong, Ning
Author_Institution :
Sch. of Inf. Eng., Shijiazhuang Univ. of Econ., Shijiazhuang, China
Abstract :
This paper proposes a novel ensemble learning framework, namely, multiple-phase cost-sensitive ensemble learning (MPCSL) which simulates the means and process of human being learning.It consists of two types learning, i.e., direct learning which learns multiple weak learners from a training dataset via some homogeneous or heterogenous algorithms, and indirect learning that constructs a committee from the knowledge of the combined filters or other committees. This paper studies empirically the performance of MPCSL on spam filtering tasks.In the occasions of combining homogeneous and heterogeneous,how the performance of MPCSL changes is surveyed. The results shows that MPCSL is a compellent ensemble learning method for cost-sensitive tasks such as spam filtering.
Keywords :
information filtering; learning (artificial intelligence); unsolicited e-mail; ensemble learning framework; multiphase cost-sensitive learning; spam filtering; training dataset; Bagging; Boosting; Hybrid intelligent systems; Information filtering; Information filters; Learning systems; Paper technology; Text categorization; Unsolicited electronic mail; Voting; cost sensitive learning; machine learning; spam filtering; text processing;
Conference_Titel :
Hybrid Intelligent Systems, 2009. HIS '09. Ninth International Conference on
Conference_Location :
Shenyang
Print_ISBN :
978-0-7695-3745-0
DOI :
10.1109/HIS.2009.120