• DocumentCode
    2910603
  • Title

    Using Rich Linguistic and Contextual Information for Tree-Based Statistical Machine Translation

  • Author

    Hung, Bui Thanh ; Nguyen Le Minh ; Shimazu, Akira

  • Author_Institution
    Grad. Sch. of Inf. Sci., Japan Adv. Inst. of Sci. & Technol., Ishikawa, Japan
  • fYear
    2011
  • fDate
    15-17 Nov. 2011
  • Firstpage
    189
  • Lastpage
    192
  • Abstract
    This paper presents an approach to select appropriate translation rules to improve phrase-reordering of tree-based statistical machine translation. We propose new features with rich linguistic and contextual information. We give a new algorithm to extract features, use maximum entropy to combine rich linguistic and contextual information and integrate these features into the tree-based SMT model (Moses-chart). We obtain substantial improvements in performance for tree-based translation from Vietnamese to English.
  • Keywords
    language translation; maximum entropy methods; Moses-chart model; Vietnamese-English translation; contextual information; feature extraction; maximum entropy; phrase-reordering; rich linguistic information; translation rule; tree-based statistical machine translation; Computational linguistics; Decoding; Entropy; Feature extraction; Pragmatics; Syntactics; Training; Linguistic and Contextual Information; Maximum Entropy Model; Phrase Reordering; Tree-based SMT;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2011 International Conference on
  • Conference_Location
    Penang
  • Print_ISBN
    978-1-4577-1733-8
  • Type

    conf

  • DOI
    10.1109/IALP.2011.60
  • Filename
    6121500