مرکز منطقه ای اطلاع رساني علوم و فناوري - On estimating robust probability distribution in HMM-based speech recognition

DocumentCode :

807618

Title :

On estimating robust probability distribution in HMM-based speech recognition

Author :

Kim, Nam Soo ; Un, Chong Kwan

Author_Institution :

Samsung Adv. Inst. of Technol., South Korea

Volume :

Issue :

fYear :

1995

fDate :

7/1/1995 12:00:00 AM

Firstpage :

279

Lastpage :

285

Abstract :

We present various methods for estimating a robust output probability distribution (PD) in speech recognition based on the discrete hidden Markov model (HMM). In speech recognition, we encounter the problem of an insufficient amount of training data, which may cause inaccurate modeling of the HMM parameters, especially the output PD´s. In this paper, to enhance the robustness of the output PD´s with respect to unseen data, we study two approaches: smoothing and tying of the PD´s. We introduce a new algorithm to smooth a PD, where a smoothing matrix is estimated by following the strategy of cross-validation as used in deleted interpolation. As for tying, we derive a number of state classes based on a clustering tree which achieves a good compromise between robustness and detail of the tied PD´s in specifying speech feature characteristics. In addition to providing an efficient method for constructing the clustering tree, we suggest a measure that accounts for the variation of estimated PD´s under various situations. The performances of the proposed methods are evaluated by speaker-independent isolated word recognition experiments and are shown to be better in recognition accuracy than that of the PD´s based on the maximum likelihood criterion

Keywords :

hidden Markov models; probability; speech recognition; HMM-based speech recognition; algorithm; clustering tree; estimating robust probability distribution; maximum likelihood criterion; modeling; smoothing matrix; speaker-independent isolated word recognition experiments; speech feature characteristics; state classes; training data; Clustering algorithms; Hidden Markov models; Interpolation; Maximum likelihood estimation; Performance evaluation; Probability distribution; Robustness; Smoothing methods; Speech recognition; Training data;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/89.397092

Filename :

397092

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=807618