DocumentCode :
3492236
Title :
Segmental intensity and HMM modeling
Author :
Dumouchel, P. ; Shaughnessy, D. O´
Author_Institution :
INRS-Telecommun., Quebec Univ., Verdun, Que., Canada
Volume :
2
fYear :
1995
fDate :
5-8 Sep 1995
Firstpage :
995
Abstract :
We propose to use a stochastic segmental intensity model independent of the HMM model in INRS´s large vocabulary continuous speech recognizer. First, we examine how to insert this model into the search algorithm without violating the optimality constraints of this algorithm. Second, we propose and test the performance of four different intensity models. The training and testing of the models is done on a studio quality speaker-dependent speech corpus. The first model is a Gaussian mixture phone intensity model independent of the phonemic context. The second model is a Gaussian mixture phone intensity model dependent on the right or left phoneme context. The third model is a Gaussian mixture intensity model based on the variation of intensity within a diphone. Finally, the last model consists of a stochastic silence-speech detector. Performance comparisons show that the best model uses Gaussian mixture of the variation of intensity within a diphone (third model). This model improves the percentage of word recognition from 89.58% (no intensity modeling) to 90.92%
Keywords :
Gaussian processes; hidden Markov models; speech recognition; Gaussian mixture phone intensity model; HMM model; continuous speech recognition; diphone; large vocabulary; optimality constraints; phoneme context; search algorithm; stochastic segmental intensity model; stochastic silence-speech detector; studio quality speaker-dependent speech corpus; testing; training; word recognition; Automata; Business; Context modeling; Detectors; Hidden Markov models; Speech recognition; Stochastic processes; Stress; Testing; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical and Computer Engineering, 1995. Canadian Conference on
Conference_Location :
Montreal, Que.
ISSN :
0840-7789
Print_ISBN :
0-7803-2766-7
Type :
conf
DOI :
10.1109/CCECE.1995.526596
Filename :
526596
Link To Document :
بازگشت