• DocumentCode
    2910542
  • Title

    A Rule-Based Source-Side Reordering on Phrase Structure Subtrees

  • Author

    Liang, Fangli ; Chen, Lei ; Li, Miao ; Nasun-Urtu

  • Author_Institution
    Inst. of Intell. Machines, Univ. of Sci. & Technol. of China, Hefei, China
  • fYear
    2011
  • fDate
    15-17 Nov. 2011
  • Firstpage
    173
  • Lastpage
    176
  • Abstract
    Since different languages put words in different orders, reordering is an important issue in statistical machine translation. The paper proposes a rule-based reordering method at the source side as a preprocessing step, which applies some syntactic reordering rules on the phrase structure subtree to reorder source language. The reordering rules integrate the phrase structure tree with part-of-speech tags, which can implement the reordering not only between words but also between words and phrases. And the problems of long-distance reordering and translation errors can be partly solved. Meanwhile, the interference between reordering rules of this method has been significantly reduced in this method. Experiments shows that our method can improve the performance of the state-of-the-art phrase translation models, achieving 1.71 BLEU score increase over the standard phrase-based machine translation system.
  • Keywords
    computational linguistics; language translation; natural language processing; trees (mathematics); 1.71 BLEU score; long-distance reordering; part-of-speech tags; phrase structure subtrees; rule-based source side reordering method; standard phrase-based machine translation system; statistical machine translation; syntactic reordering rules; Context; Decoding; Entropy; Interference; Pragmatics; Syntactics; Training; part-of-speech tags; phrase structure trees; reordering; statistical machine translation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2011 International Conference on
  • Conference_Location
    Penang
  • Print_ISBN
    978-1-4577-1733-8
  • Type

    conf

  • DOI
    10.1109/IALP.2011.12
  • Filename
    6121496