DocumentCode :
2875675
Title :
Investigating translation of Parliament speeches
Author :
Dechelotte, D. ; Schwenk, H. ; Gauvain, J.-L. ; Galibert, O. ; Lamel, L.
Author_Institution :
LIMSI-CNRS, Orsay
fYear :
2005
fDate :
27-27 Nov. 2005
Firstpage :
116
Lastpage :
120
Abstract :
This paper reports on recent experiments for speech to text (STT) translation of European Parliamentary speeches. A Spanish speech to English text translation system has been built using data from the TC-STAR European project. The speech recognizer is a state-of-the-art multipass system trained for the Spanish EPPS task and the statistical translation system relies on the IBM-4 model. First, MT results are compared using manual transcriptions and 1-best ASR hypotheses with different word error rates. Then, a n-best interface between the ASR and MT components is investigated to improve the STT process. Derivation of the fundamental equation for machine translation suggests that the source language model is not necessary for STT. This was investigated by using weak source language models and by n-best rescoring adding the acoustic model score only. A significant loss in the BLEU score was observed suggesting that the source language model is needed given the insufficiencies of the translation model. Adding the source language model score in the n-best rescoring process recovers the loss and slightly improves the BLEU score over the 1-best ASR hypothesis. The system achieves a BLEU score of 37.3 with an ASR word error rate of 10% and a BLEU score of 40.5 using the manual transcripts
Keywords :
language translation; natural languages; speech recognition; speech synthesis; BLEU score; European Parliamentary speeches; machine translation; multipass system; source language model; speech recognizer; speech to text translation; Automatic speech recognition; Equations; Error analysis; Humans; Natural languages; Speech recognition; Speech synthesis; Statistics; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
Conference_Location :
San Juan
Print_ISBN :
0-7803-9478-X
Electronic_ISBN :
0-7803-9479-8
Type :
conf
DOI :
10.1109/ASRU.2005.1566514
Filename :
1566514
Link To Document :
بازگشت