Title :
Segmental intensity and HMM modeling
Author :
Dumouchel, P. ; Shaughnessy, D. O´
Author_Institution :
INRS-Telecommun., Quebec Univ., Verdun, Que., Canada
Abstract :
We propose to use a stochastic segmental intensity model independent of the HMM model in INRS´s large vocabulary continuous speech recognizer. First, we examine how to insert this model into the search algorithm without violating the optimality constraints of this algorithm. Second, we propose and test the performance of four different intensity models. The training and testing of the models is done on a studio quality speaker-dependent speech corpus. The first model is a Gaussian mixture phone intensity model independent of the phonemic context. The second model is a Gaussian mixture phone intensity model dependent on the right or left phoneme context. The third model is a Gaussian mixture intensity model based on the variation of intensity within a diphone. Finally, the last model consists of a stochastic silence-speech detector. Performance comparisons show that the best model uses Gaussian mixture of the variation of intensity within a diphone (third model). This model improves the percentage of word recognition from 89.58% (no intensity modeling) to 90.92%
Keywords :
Gaussian processes; hidden Markov models; speech recognition; Gaussian mixture phone intensity model; HMM model; continuous speech recognition; diphone; large vocabulary; optimality constraints; phoneme context; search algorithm; stochastic segmental intensity model; stochastic silence-speech detector; studio quality speaker-dependent speech corpus; testing; training; word recognition; Automata; Business; Context modeling; Detectors; Hidden Markov models; Speech recognition; Stochastic processes; Stress; Testing; Vocabulary;
Conference_Titel :
Electrical and Computer Engineering, 1995. Canadian Conference on
Conference_Location :
Montreal, Que.
Print_ISBN :
0-7803-2766-7
DOI :
10.1109/CCECE.1995.526596