Title :
Possibility Theory-Based Approach to Spam Email Detection
Author :
Tran, Dat ; Ma, Wanli ; Sharma, Dharmendra ; Nguyen, Thien
Author_Institution :
Univ. of Canberra, Canberra
Abstract :
Most of current spam email detection systems use keywords in a blacklist to detect spam emails. However these keywords can be written as misspellings, for example "baank", "ba-nk" and "bankk" instead of "bank". Moreover, misspellings are changed from time to time and hence spam email detection system needs to constantly update the blacklist to detect spam emails containing such misspellings. However it is impossible to predict all possible misspellings for a given keyword to add those to the blacklist. We present a possibility theory-based approach to spam email detection to solve this problem. We consider every keyword in the blacklist along with its misspellings as a fuzzy set and propose a possibility function. This function will be used to calculate a possibility score for an unknown email. Using a proposed if-then rule and this core, we can decide whether or not this unknown email is spam. Experimental results are also presented.
Keywords :
fuzzy set theory; possibility theory; unsolicited e-mail; fuzzy set; if-then rule; possibility function; possibility score; possibility theory-based approach; spam email detection; Banking; Bayesian methods; Control systems; Educational institutions; Filters; Fuzzy sets; Postal services; Statistical analysis; USA Councils; Unsolicited electronic mail;
Conference_Titel :
Granular Computing, 2007. GRC 2007. IEEE International Conference on
Conference_Location :
Fremont, CA
Print_ISBN :
978-0-7695-3032-1
DOI :
10.1109/GrC.2007.123