DocumentCode :
3427631
Title :
Developing high performance asr in the IBM multilingual speech-to-speech translation system
Author :
Cui, Xiaodong ; Gu, Liang ; Xiang, Bing ; Zhang, Wei ; Gao, Yuqing
Author_Institution :
T. J. Watson Res. Center, IBM, Yorktown Heights, NY
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
5121
Lastpage :
5124
Abstract :
This paper presents our recent development of the real-time speech recognition component in the IBM English/Iraqi Arabic speech-to-speech translation system for the DARPA Transtac project. We describe the details of the acoustic and language modeling that lead to high recognition accuracy and noise robustness and give the performance of the system on the evaluation sets of spontaneous conversational speech. We also introduce the streaming decoding structure and several speedup techniques that achieves best recognition accuracy at about 0.3 x RT recognition speed.
Keywords :
decoding; natural languages; speech recognition; IBM multilingual speech translation; acoustic modeling; language modeling; large vocabulary spontaneous speech recognition; noise robustness; streaming mode decoding; Acoustic noise; Automatic speech recognition; Decoding; Hidden Markov models; Linear discriminant analysis; Natural languages; Noise robustness; Real time systems; Speech enhancement; Speech recognition; discriminative training; large vocabulary spontaneous speech recognition; multilingual speech translation; noise robustness; streaming mode decoding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518811
Filename :
4518811
Link To Document :
بازگشت