مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning state-dependent stream weights for multi-codebook HMM speech recognition systems

DocumentCode :

290087

Title :

Learning state-dependent stream weights for multi-codebook HMM speech recognition systems

Author :

Rogina, I. ; Waibel, A.

Author_Institution :

Karlsruhe Univ., Germany

Volume :

fYear :

1994

fDate :

19-22 Apr 1994

Abstract :

Many speech recognition systems use multiple information streams to compute HMM output probabilities (e.g. systems based on semicontinuous or discrete HMMs use one codebook for cepstral coefficients, and another one for delta cepstral coefficients). The final score is a weighted sum of the contributions of every stream. These weights can be found empirically and usually the same set of weights is used for every acoustic model. There is reason to believe that there are features which are more important for some acoustic models than for others. Especially one would expect the beginning and ending segment of a phoneme to be more context dependent than the middle part, so in that case the probability estimator of the speech recognizer should put more emphasis on the delta-spectrum than on the spectrum. Experiments have shown that spectral or cepstral coefficients are more important than their derivatives and more important than power or delta-power coefficients. We propose an algorithm for learning individual stream weights for every HMM state. Since these individual weights are a superset of the stream-only dependent weights, they can reproduce the results of the stream-only dependent weights and, additionally, discriminate between HMM states. Thus, the recognition performance must improve

Keywords :

hidden Markov models; probability; spectral analysis; speech coding; speech recognition; HMM output probabilities; acoustic model; algorithm; cepstral coefficients; delta cepstral coefficients; delta-power coefficients; delta-spectrum; learning; multi-codebook HMM; multiple information streams; phoneme; probability estimator; recognition performance; spectral coefficients; speech coding; speech recognition systems; state-dependent stream weights; stream-only dependent weights; Cepstral analysis; Hidden Markov models; Speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on

Conference_Location :

Adelaide, SA

ISSN :

1520-6149

Print_ISBN :

0-7803-1775-0

Type :

conf

DOI :

10.1109/ICASSP.1994.389316

Filename :

389316

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=290087