Title :
Factored translation model in English-to-Thai translation
Author :
Porkaew, Peerachet ; Takhom, Akkharawoot ; Supnithi, Thepchai
Author_Institution :
Human Language Technol. Lab., Nat. Electron. & Comput. Technol., Pathumthani, Thailand
Abstract :
The objective of this paper is to find an appropriate combination of English factors to support English-to-Thai translation task. There are four English factors those are surface, lemma, part-of-speech (POS) and combinatory categorical grammar (CCG). Various combinations of those factors were trained to build the most efficient model for translation. We test the accuracy using the BLEU score. The result shows that the factored model which consists of lemma and CCG produce the highest accuracy at 24.21% (1.05% superior than baseline). In future work, we will extend this work to Thai-to-English translation.
Keywords :
grammars; language translation; natural language processing; BLEU score; English factors; English-to-Thai translation task; Thai-to-English translation; combinatory categorical grammar; factored translation model; lemma; part-of-speech; Bayesian methods; Humans; Information retrieval; Information technology; Internet; Natural language processing; Natural languages; Probability; Surface-mount technology; Testing;
Conference_Titel :
Natural Language Processing, 2009. SNLP '09. Eighth International Symposium on
Conference_Location :
Bangkok
Print_ISBN :
978-1-4244-4138-9
Electronic_ISBN :
978-1-4244-4139-6
DOI :
10.1109/SNLP.2009.5340929