DocumentCode :
3496410
Title :
Translating unknown words using WordNet and IPA-based-transliteration
Author :
Salam, Khan Md Anwarus ; Setsuo, Yamada ; Nishino, Tetsuro
Author_Institution :
Dept. of Inf. & Commun. Eng., Univ. of Electro-Commun., Chofu, Japan
fYear :
2011
fDate :
22-24 Dec. 2011
Firstpage :
481
Lastpage :
486
Abstract :
Due to small available English-Bangla parallel corpus, Example-Based Machine Translation (EBMT) system has high probability of handling unknown words. To improve translation quality for Bangla language, we propose a novel approach for EBMT using WordNet and International-Phonetic-Alphabet(IPA)-based transliteration. Proposed system first tries to find semantically related English words from WordNet for the unknown word. From these related words, we choose the semantically closest related word whose Bangla translation exists in English-Bangla dictionary. If no Bangla translation exists, the system uses IPA-based-transliteration. For proper nouns, the system uses Akkhor transliteration mechanism. We implemented the proposed approach in EBMT, which improved the quality of good translation by 16 points.
Keywords :
dictionaries; language translation; natural language processing; Akkhor transliteration mechanism; Bangla language; Bangla translation; EBMT system; English-Bangla dictionary; English-Bangla parallel corpus; IPA-based-transliteration; WordNet; example-based machine translation system; international-phonetic-alphabet-based transliteration; translation quality improvement; unknown word translation; Dictionaries; Gold; Indexes; Marine vehicles; Software; Example-Based Machine Translation; Machine Translation; Transliteration; WordNet;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Technology (ICCIT), 2011 14th International Conference on
Conference_Location :
Dhaka
Print_ISBN :
978-1-61284-907-2
Type :
conf
DOI :
10.1109/ICCITechn.2011.6164838
Filename :
6164838
Link To Document :
بازگشت