Title of article :
Sentiment classification of Roman-Urdu opinions using Naıve Bayesian, Decision Tree and KNN classification techniques
Author/Authors :
Bilal, Muhammad University of Agriculture - Institute of Biomedical Science(IBMS) - Department of Computer Science and Information Technology, Pakistan , Israr, Huma University of Agriculture - Institute of Biomedical Science(IBMS) - Department of Computer Science and Information Technology, Pakistan , Shahid, Muhammad University of Agriculture - Institute of Biomedical Science(IBMS) - Department of Computer Science and Information Technology, Pakistan , Khan, Amin University of Agriculture - Institute of Biomedical Science(IBMS) - Department of Computer Science and Information Technology, Pakistan
Abstract :
Sentiment mining is a field of text mining to determine the attitude of people about a particular product, topic, politician in newsgroup posts, review sites, comments on facebook posts twitter, etc. There are many issues involved in opinion mining. One important issue is that opinions could be in different languages (English, Urdu, Arabic, etc.). To tackle each language according to its orientation is a challenging task. Most of the research work in sentiment mining has been done in English language. Currently, limited research is being carried out on sentiment classification of other languages like Arabic, Italian, Urdu and Hindi. In this paper, three classification models are used for text classification using Waikato Environment for Knowledge Analysis (WEKA). Opinions written in Roman-Urdu and English are extracted from a blog. These extracted opinions are documented in text files to prepare a training dataset containing 150 positive and 150 negative opinions, as labeled examples. Testing data set is supplied to three different models and the results in each case are analyzed. The results show that Naıve Bayesian outperformed Decision Tree and KNN in terms of more accuracy, precision, recall and F-measure
Keywords :
Roman Urdu , Opinion mining , Bag of words , Naıve Bayes , Decision Tree , k , Nearest Neighbor
Journal title :
Journal Of King Saud University - Computer and Information Sciences
Journal title :
Journal Of King Saud University - Computer and Information Sciences