DocumentCode :
600225
Title :
Machine Transliteration Based on Error-Driven Learning
Author :
Ying Qin
Author_Institution :
Dept. of Comput. Sci., Beijing Foreign Studies Univ., Beijing, China
fYear :
2012
fDate :
13-15 Nov. 2012
Firstpage :
205
Lastpage :
208
Abstract :
Transliteration is a common translation method when named entities are introduced into another language. Direct orthographical mapping (DOM) approach is successfully applied in machine transliteration by segmenting a word according to syllables and then mapping them directly into target language without considering its pronunciation. The paper studies the performance of two-stage machine transliteration based on Conditional Random Fields. To reduce the amount of computation in model training, we propose an error-driven learning by dividing the training data into several groups and training the transliteration model step by step based on the error prediction data until the performance doesnât increase or the limitation of the computer. Experiments on data of NEWS2011 show that error-driven model training reduces computational complexity and saves the time of model training. Compared to the combining transliteration model, our transliteration system increases the accuracy of top-1 output with 0.06, reaching 0.652.
Keywords :
computational complexity; information retrieval; language translation; learning (artificial intelligence); natural language processing; random processes; DOM; computational complexity reduction; conditional random fields; cross-lingual information retrieval; direct orthographical mapping; error-driven learning; error-driven model training; translation method; two-stage machine transliteration; word segmentation; Accuracy; Computational modeling; Conferences; Data models; Testing; Training; Training data; Conditional Random Fields; Error-driven; Machine transliteration; Model training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Asian Language Processing (IALP), 2012 International Conference on
Conference_Location :
Hanoi
Print_ISBN :
978-1-4673-6113-2
Electronic_ISBN :
978-0-7695-4886-9
Type :
conf
DOI :
10.1109/IALP.2012.48
Filename :
6473732
Link To Document :
بازگشت