DocumentCode :
2705909
Title :
Integrating Speech Recognition and Machine Translation
Author :
Matsoukas, Spyros ; Bulyko, Ivan ; Bing Xiang ; Kham Nguyen ; Schwartz, R. ; Makhoul, John
Author_Institution :
BBN Technol., Cambridge, MA, USA
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
This paper presents a set of experiments that we conducted in order to optimize the performance of an Arabic/English machine translation system on broadcast news and conversational speech data. Proper integration of speech-to-text (STT) and machine translation (MT) requires special attention to issues such as sentence boundary detection, punctuation, STT accuracy, tokenization, conversion of spoken numbers and dates to written form, optimization of MT decoding weights, and scoring. We discuss these issues, and show that a carefully tuned STT/MT integration can lead to significant translation accuracy improvements compared to simply feeding the regular STT output to a text MT system.
Keywords :
language translation; speech recognition; Arabic-English machine translation system; MT decoding weights; broadcast news; conversational speech data; sentence boundary detection; speech recognition; speech-to-text; Broadcast technology; Broadcasting; Contracts; Decoding; Educational institutions; Information science; Lattices; Loudspeakers; Pipelines; Speech recognition; Machine Translation; Sentence Boundary Detection; Speech Recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.367311
Filename :
4218342
Link To Document :
بازگشت