Title :
Hidden Semi-Markov Model Based Speech Recognition System using Weighted Finite-State Transducer
Author :
Oura, Keiichiro ; Zen, Heiga ; Nankaku, Yoshihiko ; Lee, Akinobu ; Tokuda, Keiichi
Author_Institution :
Dept. of Comput. Sci. & Eng., Nagoya Inst. of Technol.
Abstract :
In hidden Markov models (HMMs), state duration probabilities decrease exponentially with time. It would be an inappropriate representation of temporal structure of speech. One of the solutions for this problem is integrating state duration probability distributions explicitly into the HMM. This form is known as a hidden semi-Markov model (HSMM). Although a number of attempts to use explicit duration models in speech recognition systems have been proposed, they are not consistent because various approximations were used in both training and decoding. In the present paper, a fully consistent speech recognition system based on the HSMM framework is proposed. In a speaker-dependent continuous speech recognition experiment, HSMM-based speech recognition system achieved about 5.9% relative error reduction over the corresponding HMM-based one
Keywords :
hidden Markov models; speech recognition; statistical distributions; HMM; hidden semi-Markov model; speech recognition system; state duration probability distributions; weighted finite-state transducer; Clustering algorithms; Computer science; Context modeling; Decoding; Hidden Markov models; Probability distribution; Speech recognition; State estimation; Statistical distributions; Transducers;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
Print_ISBN :
1-4244-0469-X
DOI :
10.1109/ICASSP.2006.1659950