Title :
Reordering experiments for n-gram-based SMT
Author :
Crego, J.M. ; Marino, J.B.
Author_Institution :
TALP Res. Center, Univ. Politec. de Catalunya, Barcelona
Abstract :
This paper addresses the problem of reordering in statistical machine translation (SMT). We describe an elegant and efficient approach to couple reordering (word order monotonization) and decoding, which does not need for any additional model. We use linguistically motivated reordering rules to extend a monotonic search graph (with reordering hypotheses). The extended graph is traversed in decoding when a fully- informed decision can be taken (no preprocessing decision about reordering is taken). We also show how the N-gram translation model can be successfully used as reordering model when estimated with reordered source words (to harmonize the source and target word order). Experiments are reported on the Euparl task (Spanish- to-English and English-to-Spanish). Results are presented regarding translation accuracy and computational efficiency, showing significant improvements in translation quality for both translation directions at a very low computational cost.
Keywords :
computational linguistics; decoding; graph theory; language translation; word processing; decoding; monotonic search graph; n-gram-based SMT; reordering rules; statistical machine translation; translation quality; word order monotonization; Computational efficiency; Decoding; Entropy; Equations; Frequency; Surface-mount technology; System testing;
Conference_Titel :
Spoken Language Technology Workshop, 2006. IEEE
Conference_Location :
Palm Beach
Print_ISBN :
1-4244-0872-5
DOI :
10.1109/SLT.2006.326800