Title :
Semi-supervised learning with lexical knowledge for opinion mining
Author :
Nóra, Balla-Müller ; Lemnaru, Camelia ; Potolea, Rodica
Author_Institution :
Tech. Univ. of Cluj-Napoca, Cluj-Napoca, Romania
Abstract :
Sentiment prediction for text has been an intriguing subject for the last few years. The goal of it is to automatically indicate the positive or negative attitude towards a topic of interest. The proliferation of user generated content on the World Wide Web has made it possible to perform large scale mining of public opinion. This paper presents an original implementation of a system that integrates a recently proposed semi-supervised learning algorithm for text polarity classification. Lexical prior knowledge is harnessed in conjunction with labeled and unlabeled documents. The presented method is based on joint sentiment analysis of documents and words and uses a bipartite graph representation of the data. Our system is integrated into Rapid Miner, which does not come yet with semi-supervised learners.
Keywords :
Internet; data mining; graph theory; learning (artificial intelligence); text analysis; World Wide Web; bipartite graph representation; documents sentiment analysis; lexical knowledge; opinion mining; rapid miner; semi supervised learning; text polarity classification; text sentiment prediction; Accuracy; Classification algorithms; Data mining; Dictionaries; Matrix converters; Motion pictures; Sparse matrices; conjugate gradient; domain adaptation; graph regularization; lexicon; semi-supervised learning; sentiment analysis;
Conference_Titel :
Intelligent Computer Communication and Processing (ICCP), 2010 IEEE International Conference on
Conference_Location :
Cluj-Napoca
Print_ISBN :
978-1-4244-8228-3
Electronic_ISBN :
978-1-4244-8230-6
DOI :
10.1109/ICCP.2010.5606469