DocumentCode
1798673
Title
To filter discontinuous word alignment for statistical machine translationaper
Author
Chenchen Ding ; Yamamoto, Mikio
Author_Institution
Dept. of Comput. Sci., Univ. of Tsukuba, Tsukuba, Japan
fYear
2014
fDate
7-9 July 2014
Firstpage
449
Lastpage
453
Abstract
We propose a language-independent approach to clean up word alignment errors in an aligned parallel corpus, which are caused by the unsupervised word-align process. In such an aligned corpus, we evaluate the alignment patterns of one-to-many discontinuous words by statistical measures of collocation. The alignment of discontinuous words without strong collocation tendencies will be taken as errors and deleted. We conduct experiments on two-directional Japanese-English and German-English translation tasks. The experiment results show the state-of-the-art word alignment filtered by the proposed approach can lead to a better translation performance.
Keywords
information filtering; language translation; natural language processing; pattern recognition; statistical analysis; task analysis; German-English translation tasks; Japanese-English translation tasks; alignment patterns; collocation tendencies; discontinuous word alignment filtering; discontinuous words; language-independent approach; statistical machine translationaper; statistical measures; word alignment errors; Computational linguistics; Dictionaries; Grammar; Pragmatics; Stochastic processes; Syntactics; Training; discontinuous word alignment; statistical machine translation;
fLanguage
English
Publisher
ieee
Conference_Titel
Audio, Language and Image Processing (ICALIP), 2014 International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4799-3902-2
Type
conf
DOI
10.1109/ICALIP.2014.7009834
Filename
7009834
Link To Document