Title of article :
Spam Filtering Approach based on Weighted Version of Possibilistic c-Means
Author/Authors :
hameed, sarab m. university of baghdad - college of science - department of computer science, Iraq , mohammed, marwan b. al-nahrain university, Iraq
From page :
1112
To page :
1127
Abstract :
A principal problem of any internet user is the increasing number of spam, which became a great problem today. Therefore, spam filtering has become a research fo-cus that attracts the attention of several security researchers and practitioners. Spam filtering can be viewed as a two-class classification problem. To this end, this paper proposes a spam filtering approach based on Possibilistic c-Means (PCM) algorithm and weighted distance coined as (WFCM) that can efficiently distinguish between spam and legitimate email messages. The objective of the formulated fuzzy problem is to construct two fuzzy clusters: spam and email clusters. The weight assignment is set by information gain algorithm. Experimental results on spam based benchmark dataset reveal that proper setting of feature-weight can improve the performance of the proposed spam filtering approach. Furthermore, the proposed spam filtering ap-proach performance is better than PCM and Naïve Bayes filtering technique.
Keywords :
Possibilistic c , Means (PCM) algorithm , Weighted PCM , Naïve Bayes , Spam filtering
Journal title :
Iraqi Journal Of Science
Journal title :
Iraqi Journal Of Science
Record number :
2639779
Link To Document :
بازگشت