Title :
Morpheme Segmentation Using Bilingual Features
Author :
Hui Liu ; Miao Li ; Jian Zhang ; Lei Chen
Author_Institution :
China Inst. of Intell. Machines, Univ. of Sci. & Technol., Hefei, China
Abstract :
This paper presents an optimizing morphological segmentation metric for statistical machine translation performance. Unlike previous morpheme segmentation work for getting greater linguistic accuracy we focus on factors such as consistency, coverage and granularity, which directly affect MT performance. We propose a novel combination of dictionary information and statistical model, taking advantage of source-target bilingual features. Our method effectively integrates morpheme information while avoiding the complex calculations generated with the traditional usage of the morphemes. Experiments show that the approach outperforms previously proposed ones and provides an improvement of 1.03 and 0.89 BLEU results in both phrased-based and factored-based MT model on the Chinese-Mongolian translation task.
Keywords :
dictionaries; language translation; natural language processing; statistical analysis; BLEU; Chinese-Mongolian translation task; bilingual features; dictionary information; factored-based MT model; morpheme information; morphological segmentation metric; phrased-based MT model; source-target bilingual features; statistical machine translation performance; statistical model; Computational modeling; Context modeling; Dictionaries; Mathematical model; Measurement; Morphology; Pragmatics; CRF model; Machine Translation; Morpheme Segmentation; Statistical Model;
Conference_Titel :
Asian Language Processing (IALP), 2012 International Conference on
Conference_Location :
Hanoi
Print_ISBN :
978-1-4673-6113-2
Electronic_ISBN :
978-0-7695-4886-9
DOI :
10.1109/IALP.2012.52