DocumentCode :
3545588
Title :
An Approach to Word Sense Disambiguation in English-Vietnamese-English Statistical Machine Translation
Author :
Nguyen, Quy ; Nguyen, An ; Dinh, Dien
Author_Institution :
Fac. of Comput. Sci., Univ. of Inf. Technol., Ho Chi Minh City, Vietnam
fYear :
2012
fDate :
Feb. 27 2012-March 1 2012
Firstpage :
1
Lastpage :
4
Abstract :
The most difficult problem of machine translation (MT) in general and statistical machine translation (SMT) in particular is to select the correct meaning of the polysemous words. Their correct meaning mainly depends on the context and the topic of the text. Therefore, to improve the quality of SMT by resolving semantic ambiguity of words, we integrate more knowledge about the topic of the text, part-of-speech (POS) and morphology. We applied this model to English-Vietnamese- English SMT system and BLEU scores increased over 6% compared with the baseline general SMT system, which was not integrated information about the topic or other language knowledge.
Keywords :
language translation; natural language processing; text analysis; BLEU scores; English-Vietnamese-English statistical machine translation; morphology; part-of-speech; polysemous words; text context; text topic; word sense disambiguation; Buildings; Semantics; Support vector machines; Tagging; Testing; Training; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), 2012 IEEE RIVF International Conference on
Conference_Location :
Ho Chi Minh City
Print_ISBN :
978-1-4673-0307-1
Type :
conf
DOI :
10.1109/rivf.2012.6169839
Filename :
6169839
Link To Document :
بازگشت