DocumentCode
284587
Title
Acoustic modelling of subword units in the Isadora speech recognizer
Author
Schukat-Talamazzini, E.G. ; Niemann, H. ; Eckert, W. ; Kuhn, T. ; Rieck, S.
Author_Institution
Lehrstuhl fuer Inf., Erlangen Univ., Germany
Volume
1
fYear
1992
fDate
23-26 Mar 1992
Firstpage
577
Abstract
The authors address the choice of suitable subword units for the hidden Markov model (HMM)-based front-end of a speaker-independent large vocabulary continuous speech dialog system (EVAR). In contrast to the well-known approach of using context-dependent phone-like units (for instance generalized triphones) the authors developed inventories of larger-sized subword units, so-called context-freezing units (CFU). CFU models can be considered as an approximation to the extremely desirable situation of having whole word HMMs under the limiting conditions of the training speech data at hand. Recognition experiments indicate an advantage of the context-freezing units over triphone/biphone/phone combinations in terms of the achieved word accuracy, at least in the case of German speech. Using triphones with contexts generalized by means of broad phonetic classes, the authors achieved results comparable to the CFU ones
Keywords
hidden Markov models; speech recognition; speech recognition equipment; EVAR; German speech; Isadora speech recognizer; acoustic modelling; context-freezing units; continuous speech dialog system; generalized triphones; hidden Markov model; speaker independent recognition; subword units; training speech data; word accuracy; Acoustic devices; Contracts; Feature extraction; Frequency; Hidden Markov models; Parameter estimation; Speech recognition; Stability; Training data; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location
San Francisco, CA
ISSN
1520-6149
Print_ISBN
0-7803-0532-9
Type
conf
DOI
10.1109/ICASSP.1992.225843
Filename
225843
Link To Document