Title :
Optimising hidden Markov models using discriminative output distributions
Author :
Woodland, Philip C. ; Cole, David R.
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Abstract :
Models similar to Doddington´s (1989, 1990) hidden Markov models (HMMs) that use phonetically sensitive discriminants are discussed. In this style of HMM, each state models a subspace of the overall acoustic vector; the subspace is chosen to increase discrimination between the in-class and potentially confusable out-of-class utterances. The theoretical basis is presented and various aspects of using these models are discussed, such as the method of gathering confusion statistics; obtaining the correct normalization for the subspace Gaussian distribution and the effects of this term; and the computational requirements for the method. A large number of experiments on a 104 talker British English E-set database were performed that illustrate the utility of the method on a difficult speech recognition task. The experiments give a best speaker-independent error rate 7.9%, and a best multiple speaker error rate of 3.8%
Keywords :
Markov processes; speech recognition; state-space methods; British English E-set database; Gaussian distribution; HMM; acoustic vector; confusion statistics; discriminative output distributions; hidden Markov models; in-class utterances; multiple speaker error rate; optimisation; out-of-class utterances; phonetically sensitive discriminants; speaker-independent error rate; speech recognition; state models; subspace; Acoustical engineering; Covariance matrix; Databases; Distributed computing; Eigenvalues and eigenfunctions; Error analysis; Hidden Markov models; Probability distribution; Speech recognition; Statistical distributions;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
0-7803-0003-3
DOI :
10.1109/ICASSP.1991.150397