مرکز منطقه ای اطلاع رساني علوم و فناوري - Dictionary-based discriminative HMM parameter estimation for continuous speech recognition systems

DocumentCode :

310667

Title :

Dictionary-based discriminative HMM parameter estimation for continuous speech recognition systems

Author :

Willett, Daniel ; Neukirchen, Christoph ; Rottland, Jörg

Author_Institution :

Dept. of Comput. Sci., Gerhard-Mercator-Univ., Duisburg, Germany

Volume :

fYear :

1997

fDate :

21-24 Apr 1997

Firstpage :

1515

Abstract :

The estimation of the HMM parameters has always been a major issue in the design of speech recognition systems. Discriminative objectives like maximum mutual information (MMI) or minimum classification error (MCE) have proved to be superior over the common maximum likelihood estimation (MLE) in cases where a robust estimation of the probabilistic density functions (PDFs) is not possible. The determination of the overall likelihood of an acoustic observation is the most crucial point of the MMI-parameter estimation when applied to continuous speech systems. Contrary to the common approaches that estimate the overall likelihood of the training observations by evaluating the most confusing sentences or by applying global state frequencies, this paper suggests a dictionary analysis in order to get estimates for the dictionary-based risk of mixing two HMM states. These estimates are used to estimate the observations´ likelihood and to control the discriminative MMI training procedure. Results on a monophone SCHMM speech recognition system are presented that prove the practicability of the new approach

Keywords :

acoustic signal processing; hidden Markov models; parameter estimation; speech processing; speech recognition; HMM parameter estimation; HMM states mixing; MLE; acoustic observation; continuous speech recognition systems; dictionary analysis; dictionary based discriminative HMM; dictionary based risk; discriminative MMI training; discriminative language dependent model; discriminative objectives; maximum likelihood estimation; maximum mutual information; minimum classification error; monophone SCHMM; training observations; Density functional theory; Dictionaries; Frequency estimation; Hidden Markov models; Maximum likelihood estimation; Mutual information; Parameter estimation; Robustness; Speech recognition; State estimation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on

Conference_Location :

Munich

ISSN :

1520-6149

Print_ISBN :

0-8186-7919-0

Type :

conf

DOI :

10.1109/ICASSP.1997.596238

Filename :

596238

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=310667