Title :
Lexical Word Similarity for Re-ranking in Vietnamese-English Named Entity Back Transliteration
Author :
Le Thi Hoang Diem ; Aw Ai Ti
Author_Institution :
Inst. for Infocomme Res., Singapore, Singapore
Abstract :
Transliteration is the transformation of word in original language to another language based on its pronunciation. Back transliteration is the transformation of already transliterated word in another language back to its original form. This backward process is in nature more challenging than the forward direction because of more information lost. In many cases, the back transliteration can return almost exact result, which has a minor difference in spelling compared with the original word form. We propose in this work a lexical word similarity for dictionary matching in order to re-rank the candidates and enhance the performance of a grapheme-based location name back transliteration. This method is experimented on Vietnamese-English language pair and showed improvement.
Keywords :
natural language processing; text analysis; Vietnamese-English language pair; Vietnamese-English named entity back transliteration; dictionary matching; forward transliteration process; grapheme-based location name back transliteration; language reranking; lexical word similarity; pronunciation; Accuracy; Computational linguistics; Dictionaries; Indexes; Measurement; Training data; USA Councils; Named entity; back transliteration; re-ranking; word similarity;
Conference_Titel :
Asian Language Processing (IALP), 2011 International Conference on
Conference_Location :
Penang
Print_ISBN :
978-1-4577-1733-8
DOI :
10.1109/IALP.2011.44