Title :
Maximum likelihood weighting of dynamic speech features for CDHMM speech recognition
Author :
Hernando, Javier
Author_Institution :
Dept. de Teoria del Senyal i Comunicacions, Univ. Politecnica de Catalunya, Barcelona, Spain
Abstract :
Speech dynamic features are routinely used in current speech recognition systems in combination with short-term (static) spectral features. Although many existing speech recognition systems do not weight both kinds of features, it seems convenient to use some weighting in order to increase the recognition accuracy of the system. In the cases that this weighting is performed, it is manually tuned or it consists simply in compensating the variances. The aim of this paper is to propose a method to automatically estimate an optimum state-dependent stream weighting in a continuous density hidden Markov model (CDHMM) recognition system by means of a maximum-likelihood based training algorithm. Unlike other works, it is shown that simple constraints on the new weighting parameters permit to apply the maximum-likelihood criterion to this problem. Experimental results in speaker independent digit recognition show an important increase of recognition accuracy
Keywords :
feature extraction; hidden Markov models; maximum likelihood estimation; speech processing; speech recognition; CDHMM; HMM; continuous density hidden Markov model; dynamic speech features; maximum likelihood weighting; maximum-likelihood based training algorithm; optimum state-dependent stream weighting; recognition accuracy; speaker independent digit recognition; speech recognition; weighting parameters; Automatic speech recognition; Error analysis; Hidden Markov models; Maximum likelihood estimation; Probability; Speech recognition; State estimation;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.596176