DocumentCode :
2259206
Title :
Contextual binarization for syntax-based machine translation
Author :
Chen, Qing ; Yao, Tianshun
Author_Institution :
Natural Language Process. Lab., Northeastern Univ. Shenyang, Shenyang, China
fYear :
2009
fDate :
24-27 Sept. 2009
Firstpage :
1
Lastpage :
5
Abstract :
In this paper, by relabeling nodes generated during binarization of syntactic trees, contexts can be easily and systematically integrated. This not only helps to restructure syntactic trees to obtain smaller rules, that can be acquired and exploited for translation, also helps to determine which rules are most suitable for translation. By contextual binarization, high-quality translation could be easily generated from the contextual rules, if available; otherwise the translation just falls back on original syntax-based model without performance loss. Experimental results on the NIST Chinese-to-English corpus show promising improvements, the system applying contextual binarization outperforms over both the original syntax-based system and the original one with right binarization.
Keywords :
language translation; natural language processing; NIST Chinese-to-English corpus; contextual binarization; high-quality translation; syntactic trees; syntax-based machine translation; syntax-based model; Context modeling; Data mining; Labeling; Magnetic heads; NIST; Natural language processing; Performance loss; Probability distribution; Surface-mount technology; Contextual Binarization; Simple Binarization; Syntactic Tree; Syntax-based Model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2009. NLP-KE 2009. International Conference on
Conference_Location :
Dalian
Print_ISBN :
978-1-4244-4538-7
Electronic_ISBN :
978-1-4244-4540-0
Type :
conf
DOI :
10.1109/NLPKE.2009.5313745
Filename :
5313745
Link To Document :
بازگشت