Title :
Performing sentiment analysis in Bangla microblog posts
Author :
Chowdhury, Shuvro ; Chowdhury, Wasifa
Author_Institution :
Dept. of Comput. Sci. & Eng., BRAC Univ., Dhaka, Bangladesh
Abstract :
Much of the research work on sentiment analysis has been carried out in the English language, but work in Bangla is limited to only news corpus and blogs. Microblogging sites are becoming a valuable source for publishing huge volumes of user-generated information, as users express their views, opinions, and sentiments over various topics. In this paper, we aim to automatically extract the sentiments or opinions conveyed by users from Bangla microblog posts and then identify the overall polarity of texts as either negative or positive. We use a semi-supervised bootstrapping approach for the development of the training corpus which avoids the need for labor intensive manual annotation. For classification, we use Support Vector Machine (SVM) and Maximum Entropy (MaxEnt) and do a comparative analysis on the performance of these two machine learning algorithms by experimenting with a combination of various sets of features.
Keywords :
Web sites; learning (artificial intelligence); natural language processing; support vector machines; Bangla microblog posts; English language; MaxEnt; Microblogging sites; SVM; labor intensive manual annotation; machine learning algorithms; maximum entropy; performing sentiment analysis; semisupervised bootstrapping approach; support vector machine; user generated information; Accuracy; Artificial neural networks; Feature extraction; Sentiment analysis; Support vector machines; Training; Twitter; bangla microblog posts; maximum entropy; semi-supervised bootstrapping; sentiment analysis; support vector machine;
Conference_Titel :
Informatics, Electronics & Vision (ICIEV), 2014 International Conference on
Conference_Location :
Dhaka
Print_ISBN :
978-1-4799-5179-6
DOI :
10.1109/ICIEV.2014.6850712