DocumentCode :
3229324
Title :
Use of semi-Markov models for speaker-independent phoneme recognition
Author :
Ratnayake, Nimal ; Savic, Michael ; Sorensen, Jeflrey
Author_Institution :
Dept. of Electr., Comput., & Syst. Eng., Rensselaer Polytech. Inst., Troy, NY, USA
Volume :
1
fYear :
1992
fDate :
23-26 Mar 1992
Firstpage :
565
Abstract :
Hidden Markov models (HMMs) have been used to model speech in many areas of speech processing. One characteristic of the HMM is that the probability of time spent in a particular state, or state occupancy, is geometrically distributed. This, however, becomes a serious limitation and results in inaccurate modeling when the HMMs are used for phoneme recognition. The authors use hidden semi-Markov models (HSMM) to overcome the above limitation. Semi-Markov models are a more general class of Markov chains in which the state occupancy can be explicitly modeled by an arbitrary probability mass distribution. The authors use non-parametric distributions to describe the state occupancies instead of parametric distributions such as gamma. Poisson or binomial, as analysis of actual data shows that the duration of some phonemes could not be approximated by any of the above. Preliminary tests conducted using only the linear prediction coding (LPC) cepstrum as features have shown that the use of HSMM increased the phoneme recognition accuracy to 53.7% from the 48.4% obtained using an HMM
Keywords :
hidden Markov models; linear predictive coding; speech analysis and processing; speech coding; speech recognition; HMM; LPC cepstrum; Markov chains; linear prediction coding; probability mass distribution; semi Markov models; speaker-independent phoneme recognition; speech processing; state occupancy; Cepstral analysis; Clocks; Data analysis; Hidden Markov models; Solid modeling; Speech analysis; Speech processing; Stochastic processes; Systems engineering and theory; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
ISSN :
1520-6149
Print_ISBN :
0-7803-0532-9
Type :
conf
DOI :
10.1109/ICASSP.1992.225845
Filename :
225845
Link To Document :
بازگشت