• DocumentCode
    1798673
  • Title

    To filter discontinuous word alignment for statistical machine translationaper

  • Author

    Chenchen Ding ; Yamamoto, Mikio

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Tsukuba, Tsukuba, Japan
  • fYear
    2014
  • fDate
    7-9 July 2014
  • Firstpage
    449
  • Lastpage
    453
  • Abstract
    We propose a language-independent approach to clean up word alignment errors in an aligned parallel corpus, which are caused by the unsupervised word-align process. In such an aligned corpus, we evaluate the alignment patterns of one-to-many discontinuous words by statistical measures of collocation. The alignment of discontinuous words without strong collocation tendencies will be taken as errors and deleted. We conduct experiments on two-directional Japanese-English and German-English translation tasks. The experiment results show the state-of-the-art word alignment filtered by the proposed approach can lead to a better translation performance.
  • Keywords
    information filtering; language translation; natural language processing; pattern recognition; statistical analysis; task analysis; German-English translation tasks; Japanese-English translation tasks; alignment patterns; collocation tendencies; discontinuous word alignment filtering; discontinuous words; language-independent approach; statistical machine translationaper; statistical measures; word alignment errors; Computational linguistics; Dictionaries; Grammar; Pragmatics; Stochastic processes; Syntactics; Training; discontinuous word alignment; statistical machine translation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Audio, Language and Image Processing (ICALIP), 2014 International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4799-3902-2
  • Type

    conf

  • DOI
    10.1109/ICALIP.2014.7009834
  • Filename
    7009834