• DocumentCode
    2896086
  • Title

    Automatic Extraction of Chinese/Japanese Translation Patterns Using Prefix Span

  • Author

    Qian, Wang ; Komiya, Kanako ; Kotani, Yoshiyuki

  • Author_Institution
    Grad. Sch. of Eng., Tokyo Univ. of Agric. & Technol., Tokyo, Japan
  • fYear
    2011
  • fDate
    11-13 Nov. 2011
  • Firstpage
    139
  • Lastpage
    144
  • Abstract
    In late years, a large number of translation patterns are required for the pattern based machine translation. We propose an efficient method to extract the Japanese/Chinese translation patterns from the corpora using Prefix Span. They performed chunking on the sentence pairs of the parallel corpora, collected the candidate translation patterns from them using Prefix Span, and narrow down the candidates using two criteria: the point wise mutual information (PMI) and the degree of confidence for the threshold values. The proposed method achieved precision 85% when the PMI is 1.0 and the degree of confidence is 0.15.
  • Keywords
    language translation; natural language processing; Chinese-Japanese translation pattern automatic extraction; candidate translation patterns; parallel corpora; pattern based machine translation; point wise mutual information; prefix span; sentence pair chunking; threshold values confidence degree; Artificial intelligence; Chinese; Japanese; Prefix Span; translation pattern;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Technologies and Applications of Artificial Intelligence (TAAI), 2011 International Conference on
  • Conference_Location
    Chung-Li
  • Print_ISBN
    978-1-4577-2174-8
  • Type

    conf

  • DOI
    10.1109/TAAI.2011.31
  • Filename
    6120733