DocumentCode
3490288
Title
An Approach to N-Gram Language Model Evaluation in Phrase-Based Statistical Machine Translation
Author
Jinsong Su ; Qun Liu ; Huailin Dong ; Yidong Chen ; Xiaodong Shi
Author_Institution
Inst. of Comput. Technol., Beijing, China
fYear
2012
fDate
13-15 Nov. 2012
Firstpage
201
Lastpage
204
Abstract
N-gram Language model plays an important role in statistical machine translation. Traditional methods adopt perplexity to evaluate language models, while this metric does not consider the characteristics of statistical machine translation. In this paper, we propose a novel method, namely bag-of-words decoding, to evaluate n-gram language models in phrase-based statistical machine translation. As compared with perplexity, our approach has more remarkable correlation with translation quality measured by BLEU. Experimental results on NIST data sets demonstrate the effectiveness of our method.
Keywords
language translation; natural language processing; statistical analysis; BLEU; N-gram language model evaluation; NIST data sets; bag-of-words decoding; language model evaluation; natural language processing; phrase-based statistical machine translation; translation quality; Computational modeling; Correlation; Correlation coefficient; Data models; Decoding; NIST; Training; Language model evaluation; Statistical Machine Translation;
fLanguage
English
Publisher
ieee
Conference_Titel
Asian Language Processing (IALP), 2012 International Conference on
Conference_Location
Hanoi
Print_ISBN
978-1-4673-6113-2
Electronic_ISBN
978-0-7695-4886-9
Type
conf
DOI
10.1109/IALP.2012.70
Filename
6473731
Link To Document