مرکز منطقه ای اطلاع رساني علوم و فناوري - Optimising hidden Markov models using discriminative output distributions

DocumentCode :

1924245

Title :

Optimising hidden Markov models using discriminative output distributions

Author :

Woodland, Philip C. ; Cole, David R.

Author_Institution :

Dept. of Eng., Cambridge Univ., UK

fYear :

1991

fDate :

14-17 Apr 1991

Firstpage :

545

Abstract :

Models similar to Doddington´s (1989, 1990) hidden Markov models (HMMs) that use phonetically sensitive discriminants are discussed. In this style of HMM, each state models a subspace of the overall acoustic vector; the subspace is chosen to increase discrimination between the in-class and potentially confusable out-of-class utterances. The theoretical basis is presented and various aspects of using these models are discussed, such as the method of gathering confusion statistics; obtaining the correct normalization for the subspace Gaussian distribution and the effects of this term; and the computational requirements for the method. A large number of experiments on a 104 talker British English E-set database were performed that illustrate the utility of the method on a difficult speech recognition task. The experiments give a best speaker-independent error rate 7.9%, and a best multiple speaker error rate of 3.8%

Keywords :

Markov processes; speech recognition; state-space methods; British English E-set database; Gaussian distribution; HMM; acoustic vector; confusion statistics; discriminative output distributions; hidden Markov models; in-class utterances; multiple speaker error rate; optimisation; out-of-class utterances; phonetically sensitive discriminants; speaker-independent error rate; speech recognition; state models; subspace; Acoustical engineering; Covariance matrix; Databases; Distributed computing; Eigenvalues and eigenfunctions; Error analysis; Hidden Markov models; Probability distribution; Speech recognition; Statistical distributions;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on

Conference_Location :

Toronto, Ont.

ISSN :

1520-6149

Print_ISBN :

0-7803-0003-3

Type :

conf

DOI :

10.1109/ICASSP.1991.150397

Filename :

150397

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1924245