Title :
To filter discontinuous word alignment for statistical machine translationaper
Author :
Chenchen Ding ; Yamamoto, Mikio
Author_Institution :
Dept. of Comput. Sci., Univ. of Tsukuba, Tsukuba, Japan
Abstract :
We propose a language-independent approach to clean up word alignment errors in an aligned parallel corpus, which are caused by the unsupervised word-align process. In such an aligned corpus, we evaluate the alignment patterns of one-to-many discontinuous words by statistical measures of collocation. The alignment of discontinuous words without strong collocation tendencies will be taken as errors and deleted. We conduct experiments on two-directional Japanese-English and German-English translation tasks. The experiment results show the state-of-the-art word alignment filtered by the proposed approach can lead to a better translation performance.
Keywords :
information filtering; language translation; natural language processing; pattern recognition; statistical analysis; task analysis; German-English translation tasks; Japanese-English translation tasks; alignment patterns; collocation tendencies; discontinuous word alignment filtering; discontinuous words; language-independent approach; statistical machine translationaper; statistical measures; word alignment errors; Computational linguistics; Dictionaries; Grammar; Pragmatics; Stochastic processes; Syntactics; Training; discontinuous word alignment; statistical machine translation;
Conference_Titel :
Audio, Language and Image Processing (ICALIP), 2014 International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4799-3902-2
DOI :
10.1109/ICALIP.2014.7009834