Title :
Cross-Language Opinion Target Extraction in Review Texts
Author :
Xinjie Zhou ; Xiaojun Wan ; Jianguo Xiao
Author_Institution :
MOE Key Lab. of Comput. Linguistics, Peking Univ., Beijing, China
Abstract :
Opinion target extraction is a subtask of opinion mining which is very useful in many applications. In this study, we investigate the problem in a cross-language scenario which leverages the rich labeled data in a source language for opinion target extraction in a different target language. The English labeled corpus is used as training set. We generate two Chinese training datasets with different features. Two labeling models for Chinese opinion target extraction are learned based on Conditional Random Fields (CRF). After that, we use a monolingual co-training algorithm to improve the performance of both models by leveraging the enormous unlabeled Chinese review texts on the web. Experimental results show the effectiveness of our proposed approach.
Keywords :
data mining; natural language processing; text analysis; training; CRF-based learning; Chinese opinion target extraction; Chinese review texts; English labeled corpus; conditional random fields-based learning; cross-language opinion target extraction; monolingual cotraining algorithm; opinion mining; rich labeled data; source language; target language; training set; Algorithm design and analysis; Corporate acquisitions; Data mining; Feature extraction; Information retrieval; Labeling; Training; cross-language information extraction; opinion mining; opinion target extraction;
Conference_Titel :
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
Print_ISBN :
978-1-4673-4649-8
DOI :
10.1109/ICDM.2012.32