DocumentCode :
1224345
Title :
A Syllable Lattice Approach to Speaker Verification
Author :
Jin, Minho ; Soong, Frank K. ; Yoo, Chang D.
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Korea Adv. Inst. of Sci. & Technol., Daejeon
Volume :
15
Issue :
8
fYear :
2007
Firstpage :
2476
Lastpage :
2484
Abstract :
This paper proposes a syllable-lattice-based speaker verification algorithm for Mandarin Chinese input. For each speech utterance, a syllable lattice is generated with a speaker-independent large-vocabulary continuous speech recognition system in free syllable decoding. The verification decision is made based upon the likelihood ratio between a target-speaker model and a speaker-independent background model, computed on the decoded syllable lattice. The likelihood function is calculated efficiently in a forward algorithm by considering all paths in the lattice. The proposed algorithm was evaluated using a Mandarin Chinese database, where 1832 true and 26 250 impostor trials were recorded by 19 target speakers and 180 impostors. The average duration of each trial is 2 s long without silence. The target-speaker model was adapted from the speaker-independent background model using enrollment data of two minutes with silence. The proposed algorithm achieved an equal-error rate of 0.857% which is better than 1.21% of the hidden Markov model-based speaker verification algorithm without using syllable lattices. The equal-error rate was further reduced to 0.617% by incorporating the Goussian mixture model-universal background model algorithm with 2048 Gaussian kernels whose equal error rate is 0.990%.
Keywords :
Gaussian processes; hidden Markov models; natural language processing; speaker recognition; speech coding; Gaussian mixture model; Mandarin Chinese input; continuous speech recognition system; free syllable decoding; hidden Markov model; likelihood function; speaker verification algorithm; speaker-independent background model; speech utterance; syllable lattice approach; target-speaker model; Databases; Decoding; Error analysis; Hidden Markov models; Kernel; Lattices; Loudspeakers; Speaker recognition; Speech recognition; Testing; Lattice-based speaker adaptation; Mandarin Chinese; lattice rescoring; speaker recognition;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2007.906181
Filename :
4317565
Link To Document :
بازگشت