Title :
Experiments on Indonesian-Japanese statistical machine translation
Author :
Simon, H.S. ; Purwarianti, Ayu
Author_Institution :
Bandung Inst. of Technol. (ITB), Bandung, Indonesia
Abstract :
Based on the characteristics of Indonesian and Japanese language, we did several experiments on the additional process to an Indonesian-Japanese statistical machine translation (SMT). We proposed several additional processes such as employing the POS tag information, adding the size of monolingual target corpus, using Indonesian stemmer in Indonesian to Japanese translation, eliminating Japanese particle in Japanese to Indonesian translation, and the elimination of NE tag. The experimental result showed that compared to the baseline of adding no process to the default SMT engine (here, we use Moses), the highest BLEU score was achieved by the elimination of Japanese particle in Japanese to Indonesian translation.
Keywords :
language translation; natural language processing; statistical analysis; Indonesian stemmer; Indonesian-Japanese statistical machine translation; Japanese particle elimination; NE tag elimination; POS tag information; SMT; Data models; Engines; Manuals; Mathematical model; Probabilistic logic; Training; Training data; Indonesian-Japanese statistical machine translation; NE tag; POS tag; language model; particle; stemmer;
Conference_Titel :
Computational Intelligence and Cybernetics (CYBERNETICSCOM), 2013 IEEE International Conference on
Conference_Location :
Yogyakarta
DOI :
10.1109/CyberneticsCom.2013.6865786