Title :
Acoustic modelling of subword units in the Isadora speech recognizer
Author :
Schukat-Talamazzini, E.G. ; Niemann, H. ; Eckert, W. ; Kuhn, T. ; Rieck, S.
Author_Institution :
Lehrstuhl fuer Inf., Erlangen Univ., Germany
Abstract :
The authors address the choice of suitable subword units for the hidden Markov model (HMM)-based front-end of a speaker-independent large vocabulary continuous speech dialog system (EVAR). In contrast to the well-known approach of using context-dependent phone-like units (for instance generalized triphones) the authors developed inventories of larger-sized subword units, so-called context-freezing units (CFU). CFU models can be considered as an approximation to the extremely desirable situation of having whole word HMMs under the limiting conditions of the training speech data at hand. Recognition experiments indicate an advantage of the context-freezing units over triphone/biphone/phone combinations in terms of the achieved word accuracy, at least in the case of German speech. Using triphones with contexts generalized by means of broad phonetic classes, the authors achieved results comparable to the CFU ones
Keywords :
hidden Markov models; speech recognition; speech recognition equipment; EVAR; German speech; Isadora speech recognizer; acoustic modelling; context-freezing units; continuous speech dialog system; generalized triphones; hidden Markov model; speaker independent recognition; subword units; training speech data; word accuracy; Acoustic devices; Contracts; Feature extraction; Frequency; Hidden Markov models; Parameter estimation; Speech recognition; Stability; Training data; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7803-0532-9
DOI :
10.1109/ICASSP.1992.225843