DocumentCode
2896086
Title
Automatic Extraction of Chinese/Japanese Translation Patterns Using Prefix Span
Author
Qian, Wang ; Komiya, Kanako ; Kotani, Yoshiyuki
Author_Institution
Grad. Sch. of Eng., Tokyo Univ. of Agric. & Technol., Tokyo, Japan
fYear
2011
fDate
11-13 Nov. 2011
Firstpage
139
Lastpage
144
Abstract
In late years, a large number of translation patterns are required for the pattern based machine translation. We propose an efficient method to extract the Japanese/Chinese translation patterns from the corpora using Prefix Span. They performed chunking on the sentence pairs of the parallel corpora, collected the candidate translation patterns from them using Prefix Span, and narrow down the candidates using two criteria: the point wise mutual information (PMI) and the degree of confidence for the threshold values. The proposed method achieved precision 85% when the PMI is 1.0 and the degree of confidence is 0.15.
Keywords
language translation; natural language processing; Chinese-Japanese translation pattern automatic extraction; candidate translation patterns; parallel corpora; pattern based machine translation; point wise mutual information; prefix span; sentence pair chunking; threshold values confidence degree; Artificial intelligence; Chinese; Japanese; Prefix Span; translation pattern;
fLanguage
English
Publisher
ieee
Conference_Titel
Technologies and Applications of Artificial Intelligence (TAAI), 2011 International Conference on
Conference_Location
Chung-Li
Print_ISBN
978-1-4577-2174-8
Type
conf
DOI
10.1109/TAAI.2011.31
Filename
6120733
Link To Document