DocumentCode :
2514333
Title :
On the Use of Median String for Multi-source Translation
Author :
González-Rubio, Jesús ; Casacuberta, Francisco
Author_Institution :
Dept. de Sist. Informaticos y Comput., Univ. Politec. de Valencia, Valencia, Spain
fYear :
2010
fDate :
23-26 Aug. 2010
Firstpage :
4328
Lastpage :
4331
Abstract :
State-of-the-art approaches to multi-source translation involve a multimodal-like process which applies an individual translation system to each source language. Then, the translations of the individual systems are combined to obtain a consensus output. We propose to use the (generalised) median string as the consensus output of the individual translation systems. Different approximations to the median string are studied as well as different approaches to improve the median string performance when dealing with natural language strings. The proposed approaches were evaluated on the Europarl corpus, achieving significant improvements in translation quality.
Keywords :
language translation; natural language processing; Europarl corpus; median string; multimodal-like process; multisource translation system; natural language strings; source language; Approximation algorithms; Approximation methods; Conferences; Error analysis; Pattern recognition; Training; Vocabulary; Multi-source machine translation; mean string;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
ISSN :
1051-4651
Print_ISBN :
978-1-4244-7542-1
Type :
conf
DOI :
10.1109/ICPR.2010.1052
Filename :
5597774
Link To Document :
بازگشت