DocumentCode :
2348766
Title :
Improving phrase-based SMT model with Flattened Bilingual Parse Tree
Author :
Zhang, Dakun ; Sun, Le ; Li, Wenbo
Author_Institution :
Inst. of Software, Chinese Acad. of Sci., Beijing, China
fYear :
2010
fDate :
21-23 Aug. 2010
Firstpage :
1
Lastpage :
8
Abstract :
Phrase orders influence much on translation quality. However, general phrase based methods take only the source side information for phrase orderings. We instead propose a bilingual parse structure, Flattened Bilingual Parse Tree (FBPT), for better describing the inner structure of bilingual sentences and then for better translations. The main idea is to extract phrase pairs with orientation features under the help of FBPT structure. Such features can help maintain better sentence generations during translation. Furthermore, the FBPT structure can be learned automatically from parallel corpus with lower costs without the need of complex linguistic parsing. Evaluations on MT08 translation task indicate that 7% relative improvement on BLEU can be achieved compared to distortion based method (like Pharaoh).
Keywords :
grammars; language translation; statistical analysis; trees (mathematics); MT08 translation task; flattened bilingual parse tree; linguistic parsing; parallel corpus; phrase orderings; phrase-based SMT model; source side information; statistical machine translation; translation quality; Statistical machine translation; bilingual parse structure; formally syntax-based method;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering (NLP-KE), 2010 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-6896-6
Type :
conf
DOI :
10.1109/NLPKE.2010.5587836
Filename :
5587836
Link To Document :
بازگشت