DocumentCode :
310644
Title :
Linear dynamic segmental HMMs: variability representation and training procedure
Author :
Holmes, Wendy J. ; Russell, Martin R.
Author_Institution :
Speech Res. Unit, DRA, Malvern, UK
Volume :
2
fYear :
1997
fDate :
21-24 Apr 1997
Firstpage :
1399
Abstract :
This paper describes investigations into the use of linear dynamic segmental hidden Markov models (SHMMs) for modelling speech feature-vector trajectories and their associated variability. These models use linear trajectories to describe how features change over time, and distinguish between extra-segmental variability of different trajectories and intra-segmental variability of individual observations around any one trajectory. Analyses of mel cepstrum features have indicated that a linear trajectory is a reasonable approximation when using models with three states per phone. Good recognition performance has been demonstrated with linear SHMMs. This performance is, however, dependent on the model initialisation and training strategy, and on representing the distributions accurately according to the model assumptions
Keywords :
acoustic signal processing; cepstral analysis; feature extraction; hidden Markov models; speech processing; speech recognition; HMM; extra-segmental variability; intra-segmental variability; linear dynamic segmental hidden Markov models; linear trajectory; mel cepstrum features; speech feature-vector trajectories; speech models; speech recognition; training procedure; variability representation; Cepstral analysis; Cepstrum; Covariance matrix; Data analysis; Gaussian distribution; Hidden Markov models; Speech analysis; State estimation; Trajectory; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
ISSN :
1520-6149
Print_ISBN :
0-8186-7919-0
Type :
conf
DOI :
10.1109/ICASSP.1997.596209
Filename :
596209
Link To Document :
بازگشت