• DocumentCode
    3490288
  • Title

    An Approach to N-Gram Language Model Evaluation in Phrase-Based Statistical Machine Translation

  • Author

    Jinsong Su ; Qun Liu ; Huailin Dong ; Yidong Chen ; Xiaodong Shi

  • Author_Institution
    Inst. of Comput. Technol., Beijing, China
  • fYear
    2012
  • fDate
    13-15 Nov. 2012
  • Firstpage
    201
  • Lastpage
    204
  • Abstract
    N-gram Language model plays an important role in statistical machine translation. Traditional methods adopt perplexity to evaluate language models, while this metric does not consider the characteristics of statistical machine translation. In this paper, we propose a novel method, namely bag-of-words decoding, to evaluate n-gram language models in phrase-based statistical machine translation. As compared with perplexity, our approach has more remarkable correlation with translation quality measured by BLEU. Experimental results on NIST data sets demonstrate the effectiveness of our method.
  • Keywords
    language translation; natural language processing; statistical analysis; BLEU; N-gram language model evaluation; NIST data sets; bag-of-words decoding; language model evaluation; natural language processing; phrase-based statistical machine translation; translation quality; Computational modeling; Correlation; Correlation coefficient; Data models; Decoding; NIST; Training; Language model evaluation; Statistical Machine Translation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2012 International Conference on
  • Conference_Location
    Hanoi
  • Print_ISBN
    978-1-4673-6113-2
  • Electronic_ISBN
    978-0-7695-4886-9
  • Type

    conf

  • DOI
    10.1109/IALP.2012.70
  • Filename
    6473731