Title :
Integrating Speech Recognition and Machine Translation
Author :
Matsoukas, Spyros ; Bulyko, Ivan ; Bing Xiang ; Kham Nguyen ; Schwartz, R. ; Makhoul, John
Author_Institution :
BBN Technol., Cambridge, MA, USA
Abstract :
This paper presents a set of experiments that we conducted in order to optimize the performance of an Arabic/English machine translation system on broadcast news and conversational speech data. Proper integration of speech-to-text (STT) and machine translation (MT) requires special attention to issues such as sentence boundary detection, punctuation, STT accuracy, tokenization, conversion of spoken numbers and dates to written form, optimization of MT decoding weights, and scoring. We discuss these issues, and show that a carefully tuned STT/MT integration can lead to significant translation accuracy improvements compared to simply feeding the regular STT output to a text MT system.
Keywords :
language translation; speech recognition; Arabic-English machine translation system; MT decoding weights; broadcast news; conversational speech data; sentence boundary detection; speech recognition; speech-to-text; Broadcast technology; Broadcasting; Contracts; Decoding; Educational institutions; Information science; Lattices; Loudspeakers; Pipelines; Speech recognition; Machine Translation; Sentence Boundary Detection; Speech Recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.367311