Title :
Speech trajectory discrimination using the minimum classification error learning
Author :
Chengalvarayan, Rathinavelu ; Deng, Li
Author_Institution :
Dept. of Electr. & Comput. Eng., Waterloo Univ., Ont., Canada
fDate :
11/1/1998 12:00:00 AM
Abstract :
In this paper, we extend the maximum likelihood (ML) training algorithm to the minimum classification error (MCE) training algorithm for discriminatively estimating the state-dependent polynomial coefficients in the stochastic trajectory model or the trended hidden Markov model (HMM) originally proposed in Deng (1992). The main motivation of this extension is the new model space for smoothness-constrained, state-bound speech trajectories associated with the trended HMM, contrasting the conventional, stationary-state HMM, which describes only the piecewise-constant “degraded trajectories” in the observation data. The discriminative training implemented for the trended HMM has the potential to utilize this new, constrained model space, thereby providing stronger power to disambiguate the observational trajectories generated from nonstationary sources corresponding to different speech classes. Phonetic classification results are reported which demonstrate consistent performance improvements with use of the MCE-trained trended HMM both over the regular ML-trained trended HMM and over the MCE-trained stationary-state HMM
Keywords :
hidden Markov models; maximum likelihood estimation; pattern classification; speech recognition; HMM; MCE-trained trended HMM; maximum likelihood training algorithm; minimum classification error learning; phonetic classification; piecewise-constant degraded trajectories; smoothness-constrained state-bound speech trajectories; speech class; speech trajectory discrimination; state-dependent polynomial coefficients; stochastic trajectory model; trended hidden Markov model; Helium; Hidden Markov models; Maximum likelihood estimation; Polynomials; Power generation; Space stations; Speech analysis; Speech recognition; State estimation; Stochastic processes;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on