DocumentCode
629557
Title
The feasibility analysis of re-ranking for N-best lists on English-Turkish machine translation
Author
Yildirim, E. ; Tantug, Ahmet Cuneyd
Author_Institution
Dept. of Comput. Eng., Istanbul Tech. Univ., Istanbul, Turkey
fYear
2013
fDate
19-21 June 2013
Firstpage
1
Lastpage
5
Abstract
In this paper, we present the results of re-ranking for N-best list on machine translations. The main purpose of this research is to determine the upper bound of MT success that can be gained by reordering possible candidate translations. We use Google Translate Research API1 as our Statistical Machine Translation (SMT) system to get the N-best lists consisting of possible Turkish translations for a given English sentence. We evaluate the effect of reordering using three simple methods: unigram count (UC), unigram ratio (UR), and first four characters match (FFCM). We collected 720 sentences in order to give to the SMT system, and then we used 3 different sets of Turkish translations of them to evaluate our work on the N-best lists. Success of re-ranking is determined by using BLEU metric, besides an inclusive investigation which is necessary especially for agglutinative languages (e.g. Turkish, Czech, Hungarian, and Finnish) is performed by using BLEU+ MT scoring tool. We observe an improvement in BLEU score from 31.71 for the baseline system to 35.46 which is about 11.81% relative for the re-ranked model using UR.
Keywords
information retrieval; language translation; natural language processing; statistical analysis; API; BLEU metric; BLEU+ MT scoring tool; English-Turkish machine translation; FFCM; Google Translate Research; N-best list; SMT system; agglutinative language; baseline system; candidate translation reordering; first four characters match; reranking feasibility analysis; statistical machine translation; unigram count; unigram ratio; Algorithm design and analysis; Computational modeling; Computers; Educational institutions; Google; Measurement; Speech recognition; BLEU+; machine translation; n-best list; re-ranking;
fLanguage
English
Publisher
ieee
Conference_Titel
Innovations in Intelligent Systems and Applications (INISTA), 2013 IEEE International Symposium on
Conference_Location
Albena
Print_ISBN
978-1-4799-0659-8
Type
conf
DOI
10.1109/INISTA.2013.6577652
Filename
6577652
Link To Document