DocumentCode :
290087
Title :
Learning state-dependent stream weights for multi-codebook HMM speech recognition systems
Author :
Rogina, I. ; Waibel, A.
Author_Institution :
Karlsruhe Univ., Germany
Volume :
i
fYear :
1994
fDate :
19-22 Apr 1994
Abstract :
Many speech recognition systems use multiple information streams to compute HMM output probabilities (e.g. systems based on semicontinuous or discrete HMMs use one codebook for cepstral coefficients, and another one for delta cepstral coefficients). The final score is a weighted sum of the contributions of every stream. These weights can be found empirically and usually the same set of weights is used for every acoustic model. There is reason to believe that there are features which are more important for some acoustic models than for others. Especially one would expect the beginning and ending segment of a phoneme to be more context dependent than the middle part, so in that case the probability estimator of the speech recognizer should put more emphasis on the delta-spectrum than on the spectrum. Experiments have shown that spectral or cepstral coefficients are more important than their derivatives and more important than power or delta-power coefficients. We propose an algorithm for learning individual stream weights for every HMM state. Since these individual weights are a superset of the stream-only dependent weights, they can reproduce the results of the stream-only dependent weights and, additionally, discriminate between HMM states. Thus, the recognition performance must improve
Keywords :
hidden Markov models; probability; spectral analysis; speech coding; speech recognition; HMM output probabilities; acoustic model; algorithm; cepstral coefficients; delta cepstral coefficients; delta-power coefficients; delta-spectrum; learning; multi-codebook HMM; multiple information streams; phoneme; probability estimator; recognition performance; spectral coefficients; speech coding; speech recognition systems; state-dependent stream weights; stream-only dependent weights; Cepstral analysis; Hidden Markov models; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
ISSN :
1520-6149
Print_ISBN :
0-7803-1775-0
Type :
conf
DOI :
10.1109/ICASSP.1994.389316
Filename :
389316
Link To Document :
بازگشت