DocumentCode :
2982182
Title :
Cross-Language Opinion Target Extraction in Review Texts
Author :
Xinjie Zhou ; Xiaojun Wan ; Jianguo Xiao
Author_Institution :
MOE Key Lab. of Comput. Linguistics, Peking Univ., Beijing, China
fYear :
2012
fDate :
10-13 Dec. 2012
Firstpage :
1200
Lastpage :
1205
Abstract :
Opinion target extraction is a subtask of opinion mining which is very useful in many applications. In this study, we investigate the problem in a cross-language scenario which leverages the rich labeled data in a source language for opinion target extraction in a different target language. The English labeled corpus is used as training set. We generate two Chinese training datasets with different features. Two labeling models for Chinese opinion target extraction are learned based on Conditional Random Fields (CRF). After that, we use a monolingual co-training algorithm to improve the performance of both models by leveraging the enormous unlabeled Chinese review texts on the web. Experimental results show the effectiveness of our proposed approach.
Keywords :
data mining; natural language processing; text analysis; training; CRF-based learning; Chinese opinion target extraction; Chinese review texts; English labeled corpus; conditional random fields-based learning; cross-language opinion target extraction; monolingual cotraining algorithm; opinion mining; rich labeled data; source language; target language; training set; Algorithm design and analysis; Corporate acquisitions; Data mining; Feature extraction; Information retrieval; Labeling; Training; cross-language information extraction; opinion mining; opinion target extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
ISSN :
1550-4786
Print_ISBN :
978-1-4673-4649-8
Type :
conf
DOI :
10.1109/ICDM.2012.32
Filename :
6413729
Link To Document :
بازگشت