DocumentCode :
629553
Title :
Active learning for Turkish sentiment analysis
Author :
Cetin, Mujdat ; Fatih Amasyali, M.
Author_Institution :
Dept. of Comput. Eng., Yildiz Tech. Univ., Istanbul, Turkey
fYear :
2013
fDate :
19-21 June 2013
Firstpage :
1
Lastpage :
4
Abstract :
Sentiment analysis/classification is a widely studied problem of natural language processing and data mining. With the availability of social media, there are a lot of data but it´s hard to find a labeled training set because of its high cost. The goal of active learning is to get a better or same performance with fewer training data. In this work, the feasibility of active learning scheme for Turkish sentiment analysis is investigated. As a result, the same performance with full training set could be obtained with only half of the training set selected by active learning. Moreover, the affects of different clustering algorithms used at the initial set selection are investigated.
Keywords :
data mining; learning (artificial intelligence); natural language processing; pattern classification; pattern clustering; text analysis; Turkish sentiment analysis; active learning scheme; clustering algorithm; data mining; initial set selection; labeled training set; natural language processing; sentiment classification; social media; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Couplings; Niobium; Training; Training data; Active Learning; Clustering; Clustering Algorithms; Hierarchical Clustering; K-mean; Self Organizing Maps; Sentiment Analysis; Sentiment Classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Innovations in Intelligent Systems and Applications (INISTA), 2013 IEEE International Symposium on
Conference_Location :
Albena
Print_ISBN :
978-1-4799-0659-8
Type :
conf
DOI :
10.1109/INISTA.2013.6577648
Filename :
6577648
Link To Document :
بازگشت