DocumentCode :
290016
Title :
Improving speech recognition performance via phone-dependent VQ codebooks and adaptive language models in SPHINX-II
Author :
Hwamg, M. ; Rosenfeld, R. ; Theyer, E. ; Mosur, R. ; Chase, L. ; Weide, R. ; Huang, X. ; Alleva, F.
Author_Institution :
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
i
fYear :
1994
fDate :
19-22 Apr 1994
Abstract :
This paper presents improvements in acoustic and language modeling for automatic speech recognition. Specifically, semi-continuous HMMs (SCHMMs) with phone-dependent VQ codebooks are presented and incorporated into the SPHINX-II speech recognition system. The phone-dependent VQ codebooks relax the density-tying constraint in SCHMMs in order to obtain more detailed models. A 6% error rate reduction was achieved on the speaker-independent 20000-word Wall Street Journal (WSJ) task. Dynamic adaptation of the language model in the context of long documents is also explored. A maximum entropy framework is used to exploit long distance trigrams and trigger effects. A 10%-15% word error rate reduction is reported on the same WSJ task using the adaptive language modeling technique
Keywords :
hidden Markov models; maximum entropy methods; natural languages; speech coding; speech recognition; vector quantisation; SPHINX-II; Wall Street Journal; acoustic modeling; adaptive language modeling technique; adaptive language models; automatic speech recognition; density-tying constraint; dynamic adaptation; error rate reduction; language model; language modeling; long distance trigrams; long documents; maximum entropy framework; phone-dependent VQ codebooks; semi-continuous HMMs; speech recognition performance; trigger effects; Automatic speech recognition; Books; Computer science; Context modeling; Entropy; Error analysis; Hidden Markov models; Lattices; Natural languages; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
ISSN :
1520-6149
Print_ISBN :
0-7803-1775-0
Type :
conf
DOI :
10.1109/ICASSP.1994.389235
Filename :
389235
Link To Document :
بازگشت