Title :
On estimating robust probability distribution in HMM-based speech recognition
Author :
Kim, Nam Soo ; Un, Chong Kwan
Author_Institution :
Samsung Adv. Inst. of Technol., South Korea
fDate :
7/1/1995 12:00:00 AM
Abstract :
We present various methods for estimating a robust output probability distribution (PD) in speech recognition based on the discrete hidden Markov model (HMM). In speech recognition, we encounter the problem of an insufficient amount of training data, which may cause inaccurate modeling of the HMM parameters, especially the output PD´s. In this paper, to enhance the robustness of the output PD´s with respect to unseen data, we study two approaches: smoothing and tying of the PD´s. We introduce a new algorithm to smooth a PD, where a smoothing matrix is estimated by following the strategy of cross-validation as used in deleted interpolation. As for tying, we derive a number of state classes based on a clustering tree which achieves a good compromise between robustness and detail of the tied PD´s in specifying speech feature characteristics. In addition to providing an efficient method for constructing the clustering tree, we suggest a measure that accounts for the variation of estimated PD´s under various situations. The performances of the proposed methods are evaluated by speaker-independent isolated word recognition experiments and are shown to be better in recognition accuracy than that of the PD´s based on the maximum likelihood criterion
Keywords :
hidden Markov models; probability; speech recognition; HMM-based speech recognition; algorithm; clustering tree; estimating robust probability distribution; maximum likelihood criterion; modeling; smoothing matrix; speaker-independent isolated word recognition experiments; speech feature characteristics; state classes; training data; Clustering algorithms; Hidden Markov models; Interpolation; Maximum likelihood estimation; Performance evaluation; Probability distribution; Robustness; Smoothing methods; Speech recognition; Training data;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on