DocumentCode :
3325838
Title :
Text-independent speaker recognition by combining speaker-specific GMM with speaker adapted syllable-based HMM
Author :
Nakagawa, Seiichi ; Zhang, Wei ; Takahashi, Mitsuo
Author_Institution :
Dept. of Inf. & Comput. Sci., Toyohashi Univ. of Technol., Japan
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
We presented a new text-independent speaker recognition method by combining a speaker-specific Gaussian mixture model (GMM) with a syllable-based HMM adapted by MLLR or MAP (S. Nakagawa et al., Proc. Eurospeech, p.3017-3020, 2003). The robustness of this speaker recognition method for speaking style changes was evaluated in this paper. A speaker identification experiment, using an NTT database, which consists of sentences of data uttered at three speed modes (normal, fast and slow) by 35 Japanese speakers (22 males and 13 females) on five sessions over ten months, was conducted. Each speaker uttered only 5 training utterances (about 20 seconds in total). We obtained an accuracy of 98.8% for text-independent speaker identification for three speaking style modes (normal, fast, slow) by using a short test utterance (about 4 seconds). This result was superior to conventional methods for the same database. We show that the attractive result was brought from the compensational effect between speaker specific GMM and speaker adapted syllable based HMM.
Keywords :
Gaussian distribution; hidden Markov models; speaker recognition; GMM/HMM compensational effect; MAP; MLLR; identification accuracy; speaker adapted syllable-based HMM; speaker style change recognition robustness; speaker-specific GMM; speaker-specific Gaussian mixture model; text-independent speaker recognition; training utterances; varying speed uttered sentence data; Databases; Hidden Markov models; Loudspeakers; Maximum likelihood linear regression; Probability; Random variables; Robustness; Speaker recognition; Speech recognition; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1325927
Filename :
1325927
Link To Document :
بازگشت