Title :
Classification of english phrases and SMS text messages using Bayes and Support Vector Machine classifiers
Author :
Maier, J. ; Ferens, K.
Author_Institution :
Dep. of Electr. Eng. & Inf. Technol., Univ. of Appl. Sci. Ravensburg-Weingarten, Ravensburg
Abstract :
This paper performs a comparative analysis of several different types of SMS text classifiers: weight enhanced Multinomial naive Bayes, Poisson naive Bayes, and L2-loss Support Vector Machine. The effects of preprocessing and incorporating additional features on the classifiers were examined. The preliminary experimental results show that the use of preprocessing and incorporating additional features produced no significant gain or loss in classification efficiency. However the feature space used by the classification methods decreased, which could be beneficial for resource limited environments. In addition the solutions to the SMS text classification may be applied to other problems, like the classification of English sentences. Our collection of text messages may not be statistically significant, because of very limited sources for text messages.
Keywords :
Bayes methods; electronic messaging; natural language processing; pattern classification; support vector machines; text analysis; Bayes classifier; English phrases classification; Poisson naive Bayes; SMS text classifier; SMS text messages; classification method; support vector machine classifier; weight enhanced multinomial naive Bayes; Communication channels; Drives; Filtering; Information technology; Machine learning algorithms; Support vector machine classification; Support vector machines; Technological innovation; Telecommunication computing; Text categorization; Multinomial Naive Bayes; Poisson Naive Bayes; Relevance Filtering; SMS preprocessing; Support Vector Machine; Text filtering;
Conference_Titel :
Electrical and Computer Engineering, 2009. CCECE '09. Canadian Conference on
Conference_Location :
St. John´s, NL
Print_ISBN :
978-1-4244-3509-8
Electronic_ISBN :
0840-7789
DOI :
10.1109/CCECE.2009.5090166