Title :
Addressing Channel Mismatch through Speaker Discriminative Transforms
Author :
Pelecanos, Jason ; Navratil, J. ; Ramaswamy, Ganesh N.
Author_Institution :
Human Language Technol. Dept., IBM Thomas J. Watson Res. Center, Yorktown Heights, NY
Abstract :
This paper presents a discriminative criterion applied to Gaussian mixture models (GMMs) to reduce handset mismatch. The criterion is related to the log-likelihood-ratio (LLR) scoring approach commonly used in GMMs for speaker recognition. The algorithm attempts to perform a direct mapping of features from one channel type to an assumed undistorted target channel but with the goal of maximizing speaker discrimination using the transform. The transform attempts to maximize the posterior probability of a group of speaker models given their corresponding speech observations recorded on a different channel
Keywords :
Gaussian distribution; probability; speaker recognition; transforms; wireless channels; GMM; Gaussian mixture model; LLR scoring approach; channel mismatch addressing; direct mapping; log-likelihood-ratio; posterior probability; speaker discriminative transform; speaker recognition; Biometrics; Degradation; Humans; Natural languages; Performance evaluation; Speaker recognition; Speech recognition; Speech synthesis; Telephone sets; Telephony;
Conference_Titel :
Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The
Conference_Location :
San Juan
Print_ISBN :
1-424400471-1
Electronic_ISBN :
1-4244-0472-X
DOI :
10.1109/ODYSSEY.2006.248111