DocumentCode :
2175076
Title :
The IBM 2009 GALE Arabic speech transcription system
Author :
Kingsbury, Brian ; Soltau, Hagen ; Saon, George ; Chu, Stephen ; Kuo, Hong-Kwang ; Mangu, Lidia ; Ravuri, Suman ; Morgan, Nelson ; Janin, Adam
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
4672
Lastpage :
4675
Abstract :
We describe the Arabic broadcast transcription system fielded by IBM in the GALE Phase 4 machine translation evaluation. Key advances over our Phase 3.5 system include improvements to context-dependent modeling in vowelized Arabic acoustic models; the use of neural-network features provided by the International Computer Science Institute; Model M language models; a neural network language model that uses syntactic and morphological features; and improvements to our system combination strategy. These advances were instrumental in achieving a word error rate of 8.9% on the Phase 4 evaluation set, and an absolute improvement of 1.6% word error rate over our 2008 system on the unsequestered Phase 3.5 evaluation data.
Keywords :
IBM compatible machines; language translation; neural nets; speech recognition; GALE phase 4 machine translation evaluation; IBM 2009 GALE Arabic speech transcription system; International Computer Science Institute; context-dependent modeling; model M language model; morphological feature; neural network language model; syntactic feature; vowelized Arabic acoustic model; word error rate; Acoustics; Artificial neural networks; Computational modeling; Context modeling; Error analysis; Hidden Markov models; Syntactics; large vocabulary speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947397
Filename :
5947397
Link To Document :
بازگشت