Title :
Comparison of text-independent speaker recognition methods using VQ-distortion and discrete/continuous HMM´s
Author :
Matsui, Tomoko ; Furui, Sadaoki
Author_Institution :
NTT Human Interface Labs., Tokyo, Japan
fDate :
7/1/1994 12:00:00 AM
Abstract :
This paper compares a VQ (vector quantization)-distortion-based speaker recognition method and discrete/continuous ergodic HMM (hidden Markov model)-based ones, especially from the viewpoint of robustness against utterance variations. The authors show that a continuous ergodic HMM is as robust as a VQ-distortion method when enough data is available and that a continuous ergodic HMM is far superior to a discrete ergodic HMM. They also show that the information on transitions between different states is ineffective for text-independent speaker recognition. Therefore, the speaker recognition rates using a continuous ergodic HMM are strongly correlated with the total number of mixtures irrespective of the number of states
Keywords :
hidden Markov models; speech recognition; vector quantisation; VQ-distortion; continuous HMM; continuous ergodic HMM; discrete HMM; discrete ergodic HMM model; distortion-based speaker recognition; hidden Markov model; speaker recognition rates; state transitions; text-independent speaker recognition; utterance variations robustness; vector quantization; Covariance matrix; Hidden Markov models; Humans; Parameter estimation; Performance analysis; Robustness; Speaker recognition; Speech recognition; Testing; Vectors;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on