DocumentCode
3490317
Title
Morpheme Segmentation Using Bilingual Features
Author
Hui Liu ; Miao Li ; Jian Zhang ; Lei Chen
Author_Institution
China Inst. of Intell. Machines, Univ. of Sci. & Technol., Hefei, China
fYear
2012
fDate
13-15 Nov. 2012
Firstpage
209
Lastpage
212
Abstract
This paper presents an optimizing morphological segmentation metric for statistical machine translation performance. Unlike previous morpheme segmentation work for getting greater linguistic accuracy we focus on factors such as consistency, coverage and granularity, which directly affect MT performance. We propose a novel combination of dictionary information and statistical model, taking advantage of source-target bilingual features. Our method effectively integrates morpheme information while avoiding the complex calculations generated with the traditional usage of the morphemes. Experiments show that the approach outperforms previously proposed ones and provides an improvement of 1.03 and 0.89 BLEU results in both phrased-based and factored-based MT model on the Chinese-Mongolian translation task.
Keywords
dictionaries; language translation; natural language processing; statistical analysis; BLEU; Chinese-Mongolian translation task; bilingual features; dictionary information; factored-based MT model; morpheme information; morphological segmentation metric; phrased-based MT model; source-target bilingual features; statistical machine translation performance; statistical model; Computational modeling; Context modeling; Dictionaries; Mathematical model; Measurement; Morphology; Pragmatics; CRF model; Machine Translation; Morpheme Segmentation; Statistical Model;
fLanguage
English
Publisher
ieee
Conference_Titel
Asian Language Processing (IALP), 2012 International Conference on
Conference_Location
Hanoi
Print_ISBN
978-1-4673-6113-2
Electronic_ISBN
978-0-7695-4886-9
Type
conf
DOI
10.1109/IALP.2012.52
Filename
6473733
Link To Document