DocumentCode :
3309147
Title :
Semi-supervised learning for word sense disambiguation using parallel corpora
Author :
Mo Yu ; Shu Wang ; Conghui Zhu ; Tiejun Zhao
Author_Institution :
MOE-MS Key Lab. of Natural Language Process. & Speech, Harbin Inst. of Technol., Harbin, China
Volume :
3
fYear :
2011
fDate :
26-28 July 2011
Firstpage :
1490
Lastpage :
1494
Abstract :
The Application of word sense disambiguation (WSD) methods based on supervised machine learning are limited by the difficulties in defining sense tags and acquiring labeled data for training. In this paper, the two problems of WSD are solved in a semi-supervised learning framework with the help of parallel corpora. The sense tags are defined automatically according to the results of word alignment on the parallel corpora. And label propagation, a graph-based semi-supervised algorithm, is employed. The experiments show that our method achieves great improvement on Chinese WSD tasks and the performances get significant growth when the scale of monolingual sentences is increasing.
Keywords :
computational linguistics; learning (artificial intelligence); monolingual sentences; parallel corpora; semisupervised learning; supervised machine learning; word alignment; word sense disambiguation; Accuracy; Computational linguistics; Computers; Conferences; Support vector machines; Training; Training data; label propagation; parallel corpora; semi-supervised learning; word sense disambiguation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-61284-180-9
Type :
conf
DOI :
10.1109/FSKD.2011.6019785
Filename :
6019785
Link To Document :
بازگشت