Title :
An improved training algorithm in HMM-based speech recognition
Author :
Li, Gongjun ; Huang, Taiyi
Author_Institution :
Nat. Lab. of Pattern Recognition, Acad. Sinica, Beijing, China
Abstract :
In HMM-based speech recognition estimation of parameters of HMMs is viewed as counterpart of training or learning in traditional sequential pattern recognition since speech signal can be represented by a sequence of n-dimension vectors after features are extracted from the speech signal. However, due to variation of duration of the phone with speakers and context and its randomness, speech samples contribute differently to estimation of parameters of HMMs. While only smaller training set is accessible, for instance, in the case of speaker adaptation, the problem becomes very serious. The authors analyze the impact of different duration of the phone on the output probability likelihood. To combat the above problem, two approaches an proposed to make proportionate the contribution of speech samples to estimation of parameters of HMM: geometrically averaged probability likelihood method and centralized parametric space method. Several experiments an conducted to verify the advantage of the above approaches in HMM-based speech recognition. The results show that the recognition rate can be improved to a certain degree when any one of the above approaches is employed
Keywords :
feature extraction; hidden Markov models; parameter estimation; probability; speech recognition; HMM-based speech recognition; centralized parametric space method; context; feature extraction; geometrically averaged probability likelihood method; improved training algorithm; learning; n-dimension vector sequence; output probability likelihood; parameter estimation; phone duration variation; recognition rate; sequential pattern recognition; speaker adaptation; speakers; speech samples; speech signal; speech signal representation; training; Automatic speech recognition; Hidden Markov models; Laboratories; Parameter estimation; Pattern recognition; Speech analysis; Speech recognition; Stochastic processes;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607787