Title :
Enhanced bag-of-words model for phrase-level sentiment analysis
Author :
Kasthuriarachchy, Buddhika H. ; De Zoysa, Kasun ; Premaratne, H.L.
Author_Institution :
Sch. of Comput., Univ. of Colombo, Colombo, Sri Lanka
Abstract :
We propose a novel rule-based model to incorporate contextual information and effect of negation that enhances the performance of sentiment classification performed using bag-of-words models. We employed morphological analysis in feature extraction to ensure feature vector contains only opinionated words in a textual review. Also it reduces the dimensionality of feature vector and, eventually improves the efficiency of the classification algorithm. Further, we consider grammatical relationships to incorporate the context of adjectives and scope of negations within a phrase, to the feature vector. This enables our model to capture contextual polarity of adjectives and impact of negation words. For the morphological analysis we mainly employ Part Of Speech taggers (POS taggers) and grammatical relationships which are obtained using typed dependency parsers. By using dependency-based rules, we relax the conditional independent assumption of bag-of-words models by way of combining adjectives and negations to identified target words and, hence obtain a sentiment classification accuracy that significantly better than baseline performance.
Keywords :
feature extraction; grammars; natural language processing; pattern classification; vectors; POS taggers; adjectives; bag-of-words model; classification algorithm; conditional independent assumption; contextual information; contextual polarity; dependency-based rules; dimensionality reduction; feature extraction; feature vector; grammatical relationships; morphological analysis; negation words; part of speech taggers; performance enhancement; phrase-level sentiment analysis; rule-based model; sentiment classification accuracy; textual review; typed dependency parsers; Accuracy; Analytical models; Context; Context modeling; Feature extraction; Sentiment analysis; Support vector machine classification; bag-of-words; contextual polarity; sentiment analysis;
Conference_Titel :
Advances in ICT for Emerging Regions (ICTer), 2014 International Conference on
Print_ISBN :
978-1-4799-7731-4
DOI :
10.1109/ICTER.2014.7083903