DocumentCode :
2697063
Title :
Addressing Channel Mismatch through Speaker Discriminative Transforms
Author :
Pelecanos, Jason ; Navratil, J. ; Ramaswamy, Ganesh N.
Author_Institution :
Human Language Technol. Dept., IBM Thomas J. Watson Res. Center, Yorktown Heights, NY
fYear :
2006
fDate :
28-30 June 2006
Firstpage :
1
Lastpage :
6
Abstract :
This paper presents a discriminative criterion applied to Gaussian mixture models (GMMs) to reduce handset mismatch. The criterion is related to the log-likelihood-ratio (LLR) scoring approach commonly used in GMMs for speaker recognition. The algorithm attempts to perform a direct mapping of features from one channel type to an assumed undistorted target channel but with the goal of maximizing speaker discrimination using the transform. The transform attempts to maximize the posterior probability of a group of speaker models given their corresponding speech observations recorded on a different channel
Keywords :
Gaussian distribution; probability; speaker recognition; transforms; wireless channels; GMM; Gaussian mixture model; LLR scoring approach; channel mismatch addressing; direct mapping; log-likelihood-ratio; posterior probability; speaker discriminative transform; speaker recognition; Biometrics; Degradation; Humans; Natural languages; Performance evaluation; Speaker recognition; Speech recognition; Speech synthesis; Telephone sets; Telephony;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The
Conference_Location :
San Juan
Print_ISBN :
1-424400471-1
Electronic_ISBN :
1-4244-0472-X
Type :
conf
DOI :
10.1109/ODYSSEY.2006.248111
Filename :
4013528
Link To Document :
بازگشت