Title :
An Approach Based on Tree Kernels for Opinion Mining of Online Product Reviews
Author :
Jiang, Peng ; Zhang, Chunxia ; Fu, Hongping ; Niu, Zhendong ; Yang, Qing
Author_Institution :
Sch. of Comput. Sci. & Technol., Beijing Inst. of Technol., Beijing, China
Abstract :
Opinion mining is a challenging task to identify the opinions or sentiments underlying user generated contents, such as online product reviews, blogs, discussion forums, etc. Previous studies that adopt machine learning algorithms mainly focus on designing effective features for this complex task. This paper presents our approach based on tree kernels for opinion mining of online product reviews. Tree kernels alleviate the complexity of feature selection and generate effective features to satisfy the special requirements in opinion mining. In this paper, we define several tree kernels for sentiment expression extraction and sentiment classification, which are subtasks of opinion mining. Our proposed tree kernels encode not only syntactic structure information, but also sentiment related information, such as sentiment boundary and sentiment polarity, which are important features to opinion mining. Experimental results on a benchmark data set indicate that tree kernels can significantly improve the performance of both sentiment expression extraction and sentiment classification. Besides, a linear combination of our proposed tree kernels and traditional feature vector kernel achieves the best performances using the benchmark data set.
Keywords :
behavioural sciences computing; classification; data mining; feature extraction; natural language processing; text analysis; trees (mathematics); blogs; feature selection; feature vector kernel; machine learning; online product reviews; opinion mining; sentiment classification; sentiment expression extraction; syntactic structure information; tree kernels; opinion mining; sentiment analysis; text mining; tree kernels;
Conference_Titel :
Data Mining (ICDM), 2010 IEEE 10th International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-1-4244-9131-5
Electronic_ISBN :
1550-4786
DOI :
10.1109/ICDM.2010.104