DocumentCode :
2261564
Title :
Discrete-utterance recognition with a fast match based on total data reduction
Author :
Nouza, Jan
Author_Institution :
Dept. of Electron. & Signal Process., Tech. Univ. of Liberec, Czech Republic
Volume :
4
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
2107
Abstract :
A two-level classification scheme that is applicable to practical discrete-utterance recognition systems is presented. Both the fast and fine matching employ CDHMM (continuous-density hidden Markov model) whole-word models. The fast match is based on total data reduction, which includes both the minimization of the acoustic data flow (the numbers of speech frames and features) and the reduction of the basic HMM parameters (the numbers of states and mixtures). The optimal choice of the fast match parameters is a subject of the procedure that aims at minimizing the total classification time while preserving the maximum available recognition accuracy. On a medium-size vocabulary task (121 city names), the fast match reduced the recognition time to approximately 20% (compared with the original one-level system), with a negligible loss of accuracy. The time savings were even more considerable in the case of a system with multi-mixture HMMs
Keywords :
acoustics; data reduction; hidden Markov models; minimisation; pattern classification; pattern matching; speech recognition; vocabulary; acoustic data flow minimization; city names; classification time minimization; continuous density hidden Markov model; discrete-utterance recognition systems; fast match; fine match; maximum available recognition accuracy; medium-size vocabulary task; mixtures; model parameter reduction; multi-mixture HMMs; recognition time; speech features; speech frames; states; time savings; total data reduction; two-level classification scheme; whole-word models; Cities and towns; Classification algorithms; Hidden Markov models; Laboratories; Prototypes; Signal processing; Speech analysis; Speech recognition; Testing; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607218
Filename :
607218
Link To Document :
بازگشت