Title :
Competitive Self-Training technique for sentiment analysis in mass social media
Author :
Sola Hong ; Jaedong Lee ; Jee-Hyong Lee
Author_Institution :
Dept. of Electr. & Comput. Eng., Sungkyunkwan Univ., Suwon, South Korea
Abstract :
This paper aims to analyze user\´s emotion automatically by analyzing Twitter using "data without sentiment labels", not only "data with sentiment labels", to increase accuracy of sentiment analysis through an improved Self-Training, one of Semi-supervised learning techniques. Self-Training has a weak point that a classification mistake can reinforce itself. Self-Training iteratively modifies the model based on the output of the model. Thus, if the model generates wrong output, the model can be wrongly modified. For alleviate this weak point, we propose a competitive Self-Training technique. We create three models based on the output of the model and choose the best. Three models are created by binary mixture perspectives: the threshold, the same number, and the maximum number for updates. We repeat step that creating model and choosing a best model highest to get F-measure. Finally, we can improve the performance of sentiment analysis model.
Keywords :
data analysis; learning (artificial intelligence); social networking (online); social sciences computing; F-measure; Twitter; binary mixture perspectives; competitive self-training technique; mass social media; semisupervised learning techniques; sentiment analysis; user emotion analysis; Accuracy; Analytical models; Data models; Sentiment analysis; Support vector machines; Training; Twitter; Self-Training technique; Semi-supervised learning; Sentiment Analysis; Support Vector Machine; Twitter;
Conference_Titel :
Soft Computing and Intelligent Systems (SCIS), 2014 Joint 7th International Conference on and Advanced Intelligent Systems (ISIS), 15th International Symposium on
DOI :
10.1109/SCIS-ISIS.2014.7044857