Title :
SMS spam detection using selected text features and Boosting Classifiers
Author :
Fatemeh Akbari;Hedieh Sajedi
Author_Institution :
Dept. of Electrical, Computer and Information Technology, Qazvin Islamic Azad University, Iran
fDate :
5/1/2015 12:00:00 AM
Abstract :
Short Message Service (also called SMS), is the product of modern mobile communication, which provides more convenience and options for communication. The unwanted advertisements sent with SMS significantly interferes the normal communication among costumers, which should have been guaranteed by the mobile service providers. In this paper, we propose a GentleBoost algorithm for SMS spam detection. Because we have unbalanced data and for this type of data, Boosting Classifiers are good choice. Also, we found that in most cases, especially when we have binary Classification and unbalanced data, GentleBoost performs better than other Boosting Classifiers and leads to better performance. Therefore, we choose GentleBoost as preferred Classifier, also we guarantee that no one use this technique for SMS spam detection and this method is a completely novel method. Another important advantage of the suggested approach is related to the way of extracting the word attributes because by removing unused features and optimizing them, we reduced the number of word attributes significantly without reducing accuracy. Therefore, Our propose approach obtained high accuracy with minimum storage consumption.
Keywords :
"Feature extraction","Classification algorithms","Accuracy","Filtering","Unsolicited electronic mail","Boosting","Mobile communication"
Conference_Titel :
Information and Knowledge Technology (IKT), 2015 7th Conference on
Print_ISBN :
978-1-4673-7483-5
DOI :
10.1109/IKT.2015.7288782