DocumentCode :
323797
Title :
Transcription of broadcast news-some recent improvements to IBM´s LVCSR system
Author :
Polymenakos, L. ; Olsen, P. ; Kanvesky, D. ; Gopinath, R.A. ; Gopalakrishnan, P.S. ; Chen, S.
Author_Institution :
Dept. of Comput. Sci., IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
2
fYear :
1998
fDate :
12-15 May 1998
Firstpage :
901
Abstract :
This paper describes extensions and improvements to IBM´s large vocabulary continuous speech recognition (LVCSR) system for transcription of broadcast news. The recognizer uses an additional 35 hours of training data over the one used in the 1996 Hub4 evaluation. It includes a number of new features: optimal feature space for acoustic modeling (in training and/or testing), filler-word modeling, Bayesian information criterion (BIC) based segment clustering, an improved implementation of iterative MLLR and 4-gram language models. Results using the 1996 DARPA Hub4 evaluation data set are presented
Keywords :
Bayes methods; acoustic signal processing; broadcasting; grammars; information theory; speech recognition; speech synthesis; 4-gram language models; Bayesian information criterion; DARPA Hub4 evaluation data set; IBM LVCSR system; acoustic modeling; broadcast news transcription; filler-word modeling; iterative MLLR; large vocabulary continuous speech recognition; optimal feature space; segment clustering; testing; training; training data; Acoustic testing; Bandwidth; Broadcasting; Maximum likelihood linear regression; Speech analysis; Speech enhancement; Speech recognition; Telephony; Training data; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
ISSN :
1520-6149
Print_ISBN :
0-7803-4428-6
Type :
conf
DOI :
10.1109/ICASSP.1998.675411
Filename :
675411
Link To Document :
بازگشت