DocumentCode :
1109841
Title :
Comparison of text-independent speaker recognition methods using VQ-distortion and discrete/continuous HMM´s
Author :
Matsui, Tomoko ; Furui, Sadaoki
Author_Institution :
NTT Human Interface Labs., Tokyo, Japan
Volume :
2
Issue :
3
fYear :
1994
fDate :
7/1/1994 12:00:00 AM
Firstpage :
456
Lastpage :
459
Abstract :
This paper compares a VQ (vector quantization)-distortion-based speaker recognition method and discrete/continuous ergodic HMM (hidden Markov model)-based ones, especially from the viewpoint of robustness against utterance variations. The authors show that a continuous ergodic HMM is as robust as a VQ-distortion method when enough data is available and that a continuous ergodic HMM is far superior to a discrete ergodic HMM. They also show that the information on transitions between different states is ineffective for text-independent speaker recognition. Therefore, the speaker recognition rates using a continuous ergodic HMM are strongly correlated with the total number of mixtures irrespective of the number of states
Keywords :
hidden Markov models; speech recognition; vector quantisation; VQ-distortion; continuous HMM; continuous ergodic HMM; discrete HMM; discrete ergodic HMM model; distortion-based speaker recognition; hidden Markov model; speaker recognition rates; state transitions; text-independent speaker recognition; utterance variations robustness; vector quantization; Covariance matrix; Hidden Markov models; Humans; Parameter estimation; Performance analysis; Robustness; Speaker recognition; Speech recognition; Testing; Vectors;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.294363
Filename :
294363
Link To Document :
بازگشت