Title :
Linear dynamic segmental HMMs: variability representation and training procedure
Author :
Holmes, Wendy J. ; Russell, Martin R.
Author_Institution :
Speech Res. Unit, DRA, Malvern, UK
Abstract :
This paper describes investigations into the use of linear dynamic segmental hidden Markov models (SHMMs) for modelling speech feature-vector trajectories and their associated variability. These models use linear trajectories to describe how features change over time, and distinguish between extra-segmental variability of different trajectories and intra-segmental variability of individual observations around any one trajectory. Analyses of mel cepstrum features have indicated that a linear trajectory is a reasonable approximation when using models with three states per phone. Good recognition performance has been demonstrated with linear SHMMs. This performance is, however, dependent on the model initialisation and training strategy, and on representing the distributions accurately according to the model assumptions
Keywords :
acoustic signal processing; cepstral analysis; feature extraction; hidden Markov models; speech processing; speech recognition; HMM; extra-segmental variability; intra-segmental variability; linear dynamic segmental hidden Markov models; linear trajectory; mel cepstrum features; speech feature-vector trajectories; speech models; speech recognition; training procedure; variability representation; Cepstral analysis; Cepstrum; Covariance matrix; Data analysis; Gaussian distribution; Hidden Markov models; Speech analysis; State estimation; Trajectory; Vectors;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.596209