• DocumentCode
    3490317
  • Title

    Morpheme Segmentation Using Bilingual Features

  • Author

    Hui Liu ; Miao Li ; Jian Zhang ; Lei Chen

  • Author_Institution
    China Inst. of Intell. Machines, Univ. of Sci. & Technol., Hefei, China
  • fYear
    2012
  • fDate
    13-15 Nov. 2012
  • Firstpage
    209
  • Lastpage
    212
  • Abstract
    This paper presents an optimizing morphological segmentation metric for statistical machine translation performance. Unlike previous morpheme segmentation work for getting greater linguistic accuracy we focus on factors such as consistency, coverage and granularity, which directly affect MT performance. We propose a novel combination of dictionary information and statistical model, taking advantage of source-target bilingual features. Our method effectively integrates morpheme information while avoiding the complex calculations generated with the traditional usage of the morphemes. Experiments show that the approach outperforms previously proposed ones and provides an improvement of 1.03 and 0.89 BLEU results in both phrased-based and factored-based MT model on the Chinese-Mongolian translation task.
  • Keywords
    dictionaries; language translation; natural language processing; statistical analysis; BLEU; Chinese-Mongolian translation task; bilingual features; dictionary information; factored-based MT model; morpheme information; morphological segmentation metric; phrased-based MT model; source-target bilingual features; statistical machine translation performance; statistical model; Computational modeling; Context modeling; Dictionaries; Mathematical model; Measurement; Morphology; Pragmatics; CRF model; Machine Translation; Morpheme Segmentation; Statistical Model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Asian Language Processing (IALP), 2012 International Conference on
  • Conference_Location
    Hanoi
  • Print_ISBN
    978-1-4673-6113-2
  • Electronic_ISBN
    978-0-7695-4886-9
  • Type

    conf

  • DOI
    10.1109/IALP.2012.52
  • Filename
    6473733