Title of article :
Building an Arabic Sentiment Lexicon Using Semi-supervised Learning
Author/Authors :
Mahyoub, Fawaz H.H. King Abdulaziz University - Faculty of Computing and Information Technology, Saudi Arabia , Mahyoub, Fawaz H.H. Taiz University - Faculty of Computer Sciences and Information Technology, Yemen , Siddiqui, Muazzam A. King Abdulaziz University - Faculty of Computing and Information Technology, Saudi Arabia , Dahab, Mohamed Y. King Abdulaziz University - Faculty of Computing and Information Technology, Saudi Arabia
Abstract :
Sentiment analysis is the process of determining a predefined sentiment from text written in a natural language with respect to the entity to which it is referring. A number of lexical resources are available to facilitate this task in English. One such resource is the SentiWordNet, which assigns sentiment scores to words found in the English WordNet. In this paper, we present an Arabic sentiment lexicon that assigns sentiment scores to the words found in the Arabic WordNet. Starting from a small seed list of positive and negative words, we used semi-supervised learning to propagate the scores in the Arabic WordNet by exploiting the synset relations. Our algorithm assigned a positive sentiment score to more than 800, a negative score to more than 600 and a neutral score to more than 6000 words in the Arabic WordNet. The lexicon was evaluated by incorporating it into a machine learning-based classifier. The experiments were conducted on several Arabic sentiment corpora, and we were able to achieve a 96% classification accuracy.
Keywords :
Sentiment lexicon , Sentiment analysis , Arabic natural language processing , Text mining , Semi , supervised learning
Journal title :
Journal Of King Saud University - Computer and Information Sciences
Journal title :
Journal Of King Saud University - Computer and Information Sciences