DocumentCode :
454520
Title :
Hidden Semi-Markov Model Based Speech Recognition System using Weighted Finite-State Transducer
Author :
Oura, Keiichiro ; Zen, Heiga ; Nankaku, Yoshihiko ; Lee, Akinobu ; Tokuda, Keiichi
Author_Institution :
Dept. of Comput. Sci. & Eng., Nagoya Inst. of Technol.
Volume :
1
fYear :
2006
fDate :
14-19 May 2006
Abstract :
In hidden Markov models (HMMs), state duration probabilities decrease exponentially with time. It would be an inappropriate representation of temporal structure of speech. One of the solutions for this problem is integrating state duration probability distributions explicitly into the HMM. This form is known as a hidden semi-Markov model (HSMM). Although a number of attempts to use explicit duration models in speech recognition systems have been proposed, they are not consistent because various approximations were used in both training and decoding. In the present paper, a fully consistent speech recognition system based on the HSMM framework is proposed. In a speaker-dependent continuous speech recognition experiment, HSMM-based speech recognition system achieved about 5.9% relative error reduction over the corresponding HMM-based one
Keywords :
hidden Markov models; speech recognition; statistical distributions; HMM; hidden semi-Markov model; speech recognition system; state duration probability distributions; weighted finite-state transducer; Clustering algorithms; Computer science; Context modeling; Decoding; Hidden Markov models; Probability distribution; Speech recognition; State estimation; Statistical distributions; Transducers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1659950
Filename :
1659950
Link To Document :
بازگشت