• DocumentCode
    2348766
  • Title

    Improving phrase-based SMT model with Flattened Bilingual Parse Tree

  • Author

    Zhang, Dakun ; Sun, Le ; Li, Wenbo

  • Author_Institution
    Inst. of Software, Chinese Acad. of Sci., Beijing, China
  • fYear
    2010
  • fDate
    21-23 Aug. 2010
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Phrase orders influence much on translation quality. However, general phrase based methods take only the source side information for phrase orderings. We instead propose a bilingual parse structure, Flattened Bilingual Parse Tree (FBPT), for better describing the inner structure of bilingual sentences and then for better translations. The main idea is to extract phrase pairs with orientation features under the help of FBPT structure. Such features can help maintain better sentence generations during translation. Furthermore, the FBPT structure can be learned automatically from parallel corpus with lower costs without the need of complex linguistic parsing. Evaluations on MT08 translation task indicate that 7% relative improvement on BLEU can be achieved compared to distortion based method (like Pharaoh).
  • Keywords
    grammars; language translation; statistical analysis; trees (mathematics); MT08 translation task; flattened bilingual parse tree; linguistic parsing; parallel corpus; phrase orderings; phrase-based SMT model; source side information; statistical machine translation; translation quality; Statistical machine translation; bilingual parse structure; formally syntax-based method;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering (NLP-KE), 2010 International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-6896-6
  • Type

    conf

  • DOI
    10.1109/NLPKE.2010.5587836
  • Filename
    5587836