DocumentCode
600225
Title
Machine Transliteration Based on Error-Driven Learning
Author
Ying Qin
Author_Institution
Dept. of Comput. Sci., Beijing Foreign Studies Univ., Beijing, China
fYear
2012
fDate
13-15 Nov. 2012
Firstpage
205
Lastpage
208
Abstract
Transliteration is a common translation method when named entities are introduced into another language. Direct orthographical mapping (DOM) approach is successfully applied in machine transliteration by segmenting a word according to syllables and then mapping them directly into target language without considering its pronunciation. The paper studies the performance of two-stage machine transliteration based on Conditional Random Fields. To reduce the amount of computation in model training, we propose an error-driven learning by dividing the training data into several groups and training the transliteration model step by step based on the error prediction data until the performance doesnât increase or the limitation of the computer. Experiments on data of NEWS2011 show that error-driven model training reduces computational complexity and saves the time of model training. Compared to the combining transliteration model, our transliteration system increases the accuracy of top-1 output with 0.06, reaching 0.652.
Keywords
computational complexity; information retrieval; language translation; learning (artificial intelligence); natural language processing; random processes; DOM; computational complexity reduction; conditional random fields; cross-lingual information retrieval; direct orthographical mapping; error-driven learning; error-driven model training; translation method; two-stage machine transliteration; word segmentation; Accuracy; Computational modeling; Conferences; Data models; Testing; Training; Training data; Conditional Random Fields; Error-driven; Machine transliteration; Model training;
fLanguage
English
Publisher
ieee
Conference_Titel
Asian Language Processing (IALP), 2012 International Conference on
Conference_Location
Hanoi
Print_ISBN
978-1-4673-6113-2
Electronic_ISBN
978-0-7695-4886-9
Type
conf
DOI
10.1109/IALP.2012.48
Filename
6473732
Link To Document